P/«  12/1 


AD-A11«  236 
UMCLASSZFieO 


WISCONSIN  UNIV-NADISON  NATHENATICS  RCSCAACH  CENTER 
SIMULTANEOUS  SIMILARITY  OP  MATRICES* lU) 

MAR  at  S  PRIEOLANO 
MRC-TSR-2M8 


DAAa2O*a0«C-00Al 

NL 


COPY 


MRC  Technical  Summary  Report  #2345 


m 


SIMULTANEOUS  SIMILARITY  OF  MATRICES 


Shmuel  Friedland 


Mathematics  Research  Center 
University  of  Wisconsin-Madison 
610  Walnut  Street 
Madison,  Wisconsin  53706 

March  1982 

Received  October  2,  1981 


Approved  for  public  relepse 
Distribution  unlimited 


Sponsored  by 

U.  S.  Army  Research  Office 
P.  0.  Box  12211 
Research  Triangle  Park 
North  Carolina  27709 


EUECTEj 
MAY  181982 


026 


UNIVERSITY  OF  WISCONSIN  -  MADISON 
MATHEMATICS  RESEARCH  CENTER 

SIMULTANEOUS  SIMILARITY  OF  MATRICES 

Shmuel  Friedland^ 


Technical  Summary  Report 

March  1982  ^  dtic^ 

I  qopv 

\|»<«pe»ted 

ABSTRACT  =  ^ 


Accession  For  ^ 

iHlS  GRAfcl  M 

DTIC  TAB  g 

Unannounced  q 

Justification _ 

By - - - - 

Distribution/ 
Availability  Codes 
Avail'^i^/or 
)lst  Special 


— ^  In  this  paper  we  solve  completely  and  explicitly  the  long  standing 

problem  of  classifying  pairs  of  n^n  complex  matrices  (A/B)  under -the 

—  1  —  1 '  ^ 
simultaneous  similarity  ^  CTAT  rTBT  )  Roughly  speaking,  the  classification 

decomposes  to  a  finite  number  of  steps.  In  each  step  we  consider  an  open 
0 

algebraic  set^  ^^i72vr^,^)- —  ^n  ^  *^n  ~  complex  valued 

matrices).  Here  r  and  p  are  two  positive  integers^  Then  we  construct  a 
finite  number  of  rational  functions  « f4^i>in  the  entries  of  A  and  B 


and  B 


whose  values  are  constant  on  all  pairs  similar  M  >  „  (A,B).  The 

values  of  the  functions  <fr^{A,B),  i  =»  1,***,s,  determine  a  finite  number 


Let  be  the 


(efe  moet- K^n,^,r>)>  of  similarity  classes  in  u  „  .  Let  S_  be  the 

_ ^''n,2,r,p  n 

subspace  of  complex  symmetric  matrices  in  For  (A,B)  €  ®rj  ^ 

consider  the  similarity  class  (TAT^,  TBT^  \where  T  ranges  over  all  complex 

orthogonal  matrices.  Then  the  characteristic  Polynomial  | Al  -  (A+xB) | 

determines  a  finite  number  of  similarity  classes  for  almost  all  pairs 

(A,B)  e  S  X  S  .  \ 

n  n  ' 
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SIGNIFICANCE  AND  EXPLANATION 


Consider  the  following  linear  singular  differential  system 

(1 )  =  (A  +  B/t)x. 

t  dt  n  n 

Here  x  =  (x  .•••,x  is  a  column  vector  and  A  =  (a. and  B  =  (b, are 
in  13  1  ID  1 

nxn  complex  valued  matrices.  A  linear  change  y  =  Tx,  transforms  (1)  to 

(2)  If  =  (A  +  B,/t)y,  A,  =  tat"’,  B  =  TBT"’. 

at  111  1 

Thus  we  are  led  to  study  the  properties  of  pairs  of  matrices  (A^,B^)  which 
are  simultaneously  similar  to  (A,B).  It  can  be  shown  that  more  complicated 
singular  differential  systems  under  more  general  transformations  will  lead  again  to 
a  certain  similarity  class  of  pairs  of  matrices. 

We  now  give  another  example  of  simultaneous  similarity.  Let  A  and  B  be 
real  symmetric.  Then  'A  and  B  can  represent  a  potential  and  kinetic  energy  of 
some  physical  system.  By  changing  to  another  orthonormal  basis  we  shall  get  another 
pair  (A^,B^) 

(3)  A^  =  TAT^,  B^  =  TBT^ 

for  the  corresponding  orthogonal  matrix  T.  The  class  of  all  such  symmetric  pairs 
(A^ ,B^ )  gives  all  representations  of  the  same  physical  system.  Thus  to  classify 
such  similarity  class  is  equivalent  to  classifying  the  physical  system  up  to  a 
choice  of  an  orthonormal  basis. 

In  this  paper  we  solve  completely  the  first  mentioned  problem.  The 
classification  of  the  similarity  classes  (TAT“’ ,TBT”’ )  decomposes  to  a  finite 
number  of  steps.  In  each  step  we  consider  an  algebraic  variety  V  (a  set  of  points 
defined  by  polynomial  equations)  of  pairs  of  matrices.  We  construct  a  set  of 
invariant  functions,  sort  of  "generalized  eigenvalues"  of  the  pairs  (A,B)  lying 
in  V.  Those  "eigenvalues"  classify  all  similarity  classes  of  (A,B)  except  for 
some  algebraic  subvariety  W  in  V  .  We  now  repeat  the  process  for  W. 

In  the  case  of  symmetric  pairs  (A,B),  we  do  as  follows.  With  each  such  pair 
we  associate  the  characteristic  polynomial  p(X,x)  =  | Xl  -  (A+xB)|.  Clearly,  any 
similar  pair  (A,,B, )  has  the  same  characteristic  polynomial.  In  this  paper  we 
show  that  for  almost  all  symmetric  pairs  (A,B)  the  characteristic  polynomial 
p(X,x)  determines  the  orthogonal  similarity  class  up  to  a  finite  number  of 
possibilities.  We  strongly  believe  that  our  results  would  have  some  applications  to 
the  invariants  of  singular  differential  systems  as  well  as  to  other  areas,  in 
particular  mathematical  physics. 


The  responsibility  for  the  wording  and  views  expressed  in  this  descriptive 
summary  lies  with  MRC,  and  not  with  the  author  of  this  report. 
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SIMULTANEOUS  SIMILARITY  OF  MATRICES 
Shmuel  Frtedland^ 

0.  Introduction. 

Let  denote  the  f  -  of  nxn  complex  valued  matrices  and  the 

group  of  all  invertible  matrices  in  M^.  It  is  a  classical  problem  to  clas¬ 
sify  the  similarity  classes  (orbits)  of  m+1-tuples  of  matrices  (A.,***,A  ) 

0  m 

under  the  action  of 

(0.1)  orb(A„,***,A  )  =  {(B^,«*»,b  ),  B.  =  TA.t”\  i  =  0,***,m,  T  6  G  }. 

0  m  U  m  1  1  n 

See  Gelfand  [1970],  Gelfand-Ponomarev  [1969),  Brenner  [1975],  Nathanson 
[1980],  Processi  [1976]  and  Friedland  [1980]  for  certain  problems  in  which 
the  classification  of  such  orbits  needed,  for  various  results  on  this  problem 
and  additional  references.  It  is  known  that  the  classification  of  similarity 
classes  of  m+1  tuples  can  be  reduced  to  the  classification  of  simultaneous 
similarity  of  pairs  of  matrices  (A,B).  In  fact  one  can  assume  that  A  and 
B  are  commuting  and  even  nilpotent.  See  Gelfand-Ponomarev  [1969]  and 
Nathanson  [1980].  Therefore  in  certain  cases,  for  the  simplicity  of  the  expo¬ 
sition  we  are  dealing  with  the  simultaneous  similarity  of  pairs  of  matrices 
(m=1).  In  cases  when  the  choice  m=1  does  not  simplify  the  treatment  of  the 
problem  we  are  dealing  with  an  arbitrary  m.  In  Friedland,  [1981]  we  out¬ 
lined  a  general  procedure  for  classifying  the  orbits  of  a  given  algebraic 
group  acting  on  an  irreducible  variety  over  any  algebraically  closed  field. 

For  the  problem  of  simultaneous  similarity  our  procedure  works  as  follows.  In 
the  step  number  i  we  are  given  an  irreducible  variety  V  in 

(0.2)  M  .  =  M  X  ...  X  M 

n,m+1  ^n _ _ n 

m+1 

t 

Institute  of  Mathematics,  Hebrew  University,  Jerusalem;  Mathematics  Research 
Center,  University  of  Wisconsin-Madison  and  Department  of  Mathematics, 
University  of  Wisconsin-Milwaukee. 


Sponsored  by  the  United  States  Army  Under  Contract  No.  DAAG29-80-C-0041 


which  is  invariant  under  the  action  of  G^.  That  is  if  V  contains  a  tuple 

)  then  V  contains  orb{A.,***/A  ).  Let  [V]  and  (V)  be  the 
0  m  0  m 

2 

ring  of  polynomials  and  its  field  of  quotients  whose  variables  are  (m+1 )n 
(k) 

entries  of  A^^  =  (a^^  ),  k  =  0,  •••,m  and  whose  values  are  restricted  to  V. 

A  rational  function  f  G  (V)  is  called  invariant  if  f  is  constant  on  all 
orbits  lying  in  V.  Denote  by  [V]®  and  (V)^  the  subring  and  subfield  of 
invariant  functions  in  [V]  and  (V)  respectively.  Since  (V)  is  a  sub¬ 
field  of  rational  functions  in  (m+1 )n^  variables  (a • '/a ,  •••, 

1 1  nn 

af”\ )  it  is  known  that  (V)^  is  finitely  generated.  That  is  there 
1 1  nn 

exist  q  invariant  functions  X^»***»Xq  6  (V)^  such  that  any  x  6  (V)^  can 

be  expressed  as  a  rational  function  in  X^»***/Xq»  See  for  example  Fogarty 

[1969,  p.  69].  Then  there  exists  an  invariant  (strict)  subvariety  W  _c  v 

such  that  the  functions  X^»***»Xq  defined  on  each  point  of  the  open  set 

V-W  and  their  values  determine  the  orbit  of  (A^,»»»,A  ).  The  orbits  in  V-W 

0  m 

are  characterized  by  two  numbers  their  dimensions  and  their  degrees  as  irre¬ 
ducible  algebraic  varieties  in  m+1*  next  stage  one  has  to  clas¬ 

sify  orbits  on  W  and  the  process  ends  in  a  finite  number  of  steps.  In  Sec¬ 
tion  1  we  identify  the  sets  V-W  with  open  algebraic  sets  which 

n,m+1 ,r, p 

are  characterized  by  two  integers  r  and  p.  In  fact  n^-r  is  the  dimension 
of  any  orbit  in  Wn,m+1  ^  another  integer  which  has  a  relatively 

simple  characterization.  Roughly  speaking  p  is  the  number  of  linearly 


independent  polynomial  equations  which  determine  the  orbit  of  (A  ,»**,A  ). 

0  n 

We  show  explicitly  how  to  find  a  set  of  invariant  functions  in 

1  s 

G  0 

(W  )  such  that  any  orbit  in  M  is  determined  by  the  value 

n,m+1,r,p  ■'  n,m+1,r,p  ^ 

of  these  functions  up  to  a  finite  number  ic(n,m,r)  of  them. 

To  understand  the  complexity  of  the  general  problem  we  give  a  complete 
classification  of  pairs  of  2X2  matrices  (A,B)  under  the  simultaneous 
similarity  ( TAT“ \ TBT“ ^ ) .  We  do  it  directly  without  using  the  results  of 


-2- 


Section  1.  Our  classification  then  is  reduced  to  three  steps.  In  the  first 
step  the  coefficients  of  the  characteristic  polynomial  |  Xl  -  (A-fxB) | 
determine  the  orbit  of  (A,b)  completely  as  long  as  | Xl  -  (A+xB) |  is  irre¬ 
ducible  over  the  ring  C[A,x].  Let  U  be  the  set  of  all  pairs  (A,B)  such 
that  |Al  -  (A+xB)(  splits  to  a  product  of  two  linear  factors.  Then  U  is 
given  by  a  single  equation 

U  =  {(A,B),{2tr(A^)-(tr(A)]^}{2tr(B^)-Itr(B)]^}  =  [2tr{AB)  -  tr (A) tr ( B) ] ^. 

It  follows  that  U  IS  an  irreducible  variety  of  codimension  1 .  Let  V 
be  the  set  of  all  pairs  (A,B)  such  that  | Al  -  (A+xB) |  is  a  square  of  a 
linear  factor.  That  is,  for  any  x  A+xB  has  a  double  eigenvalue.  Then  V 
is  given  by  a  set  of  these  equations 

V  =  {(A,B),  2tr(A^)  =  (tr(A)]^,  2tr(B^)  =  [tr(B))^,  2tr(AB)  =  tr(A)tr(B)}. 

So  V  is  an  irreducible  variety  of  codimension  3.  We  show  that  on  U-V 
the  characteristic  polynomial  of  | Al  -  (A+xB) |  determine  3  distinct 
orbits.  In  order  to  distinguish  between  these  3  orbits  we  have  to  introduce 
rational  invariant  functions  in  u 


a(A,B)  = 


(b. .-b„„)a, ^a„,  +  a^„a,  b„.  -  a.  a^.b.^ 
11  22  12  21  22  11  21,  11  21  12 

a  b  —  a  b 
12  21  21  12 


3(A,B)  = 


-a  )b  b 
22  12  21 

b  a 
12  21 


+  b^„b. ^a„. 
22  12  21 

-  b  a 
21  12 


-  b  b  a 
11  21  12 


Those  functions  are  defined  on  non-commuting  pairs  (A,B)  in  U.  The  orbit 
of  the  commuting  pair  (A,B)  fe’  V  is  determined  by  its  characteristic 
polynomial.  It  is  easy  to  see  that  (V]*^  is  generated  by  tr(A)  and 

I 

tr(B).  So  the  transcendence  degree  of  the  quotient  field  of  [V]*^  is  2. 
However,  the  transcendence  degree  of  (V)  is  3  since  we  have  an  additional 
rational  invariant  function  on  Y(A,b)  =  ^^2^12  ~  ^21^*^2l’ 
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Then  tr(A),  tr(B)  and  Y(A,B)  (or  i|y(A,B))  determine  the  orbit  of 

(A,B)  as  long  as  A  and  B  are  not  the  scalar  matrices.  In  the  last  case 

tr(A)  and  tr(B)  determine  orb(A,B}.  This  shows  that  in  the  classification 

process  carried  out  in  Section  1  we  must  at  a  certain  stage  consider  rational 

invariant  functions  in  V.  We  now  list  briefly  the  contents  of  the  rest  of 

the  paper.  Sections  3-5  are  technical  and  needed  in  the  sequel.  They  deal 

P  V 

with  polynomial  maps  p:C  ♦  C  and  some  special  properties  of  certain 

algebraic  functions  in  two  variables.  In  Section  6  we  deal  with  irreducible 

paris  (pencils)  (A,B).  These  are  the  pairs  for  which  | Xl  -  (A+xB) |  is  an 

irreducible  polynomial  over  C[A,x].  The  orbits  of  these  pairs  are  determined 

completely  by  the  generators  of  the  ring  of  the  invariant  polynomials  on 

According  to  Process!  [1976]  these  generators  can  be  pic)ced  to  be  of 
i  j  D  3 

the  form  tr(A  ^B  ^  »•••»  A  ”*B  and  we  described  a  procedure  to  determine 

p 

the  orbit.  We  showed  that  the  transcendence  degree  of  [M  xm  ]  is  n(n+1) 

n  n 

Q 

however  we  were  not  able  to  find  a  transcendent  basis  in  [M  xM  ]  . 

n  n 

The  rest  of  the  paper  is  devoted  to  the  study  of  m+1  tuple  (A„,»»»,A  ) 

0  m 

of  symmetric  matrices  under  the  action  of  complex  orthogonal  group  In 

this  context 

sorb(A  .•••,A  )  =  {(B  ,  •••,B  ),  B  =  QA.q’^,  i  =  0,*»*,m,Qe  O  }. 
u  on  u  m  X  1  n 

This  case  is  of  importance  in  mathematical  physics  at  least  for  m=l .  in 

that  case  we  can  interpret  Aq  and  A,  as  the  potential  and  the  kinetic 

energies.  The  orbit  of  (Aq.A^ )  will  correspond  to  the  representations  of  the 

potential  and  the  kinetic  energies  in  different  orthogonal  bases.  Usuallly 

Aq  and  A^  are  real,  however  in  some  instances  one  considers  the  complex 

case  too.  See  for  example  Moiseyev-Friedland  [1980].  Let  be  the  set  of 

nxn  complex  symmetric  matrices  and  put 

S  =  S  X  •••  X  s  . 

n,m+1  n  ^  y  n 
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Denote  by  m+1^^  invariant  polynomials  in  the  entries 

(A  .•••.A  )  6  S  ,  under  the  action  of  0„.  In  Section  7  we  show  that  the 
0  m  n,m+1  n 

transcendent  degree  of  [S^  m+1 ^ ^  nl (n+1 )m+2)/2  for  m=1 .  The  general 

case  is  discussed  in  Section  11.  We  also  find  a  simple  transcendence  basis  in 

this  ring.  Unfortunately,  this  basis  is  not  symmetric  in  A„,***,A  .  Let 

0  m 

m 

A(x)  =  5^  A.x^  and  consider  p(X,x)  =  lXl-A(x)|  the  characteristic  poly- 

i=0  ^ 

nomial  of  A(x).  Clearly,  the  coefficients  of  p(X,x)  belong  to 

[S^  m+1^^*  number  of  these  coefficients  is  n[ (n+1 )m+2]/2!  This  suggests 

(as  in  2x2  use)  that  these  coefficients  form  a  transcendence  basis  in 

(Sn  This  is  indeed  the  case  and  Sections  8-11  are  devoted  to  prove 

this  result. 

We  also  determine  a  class  of  characteristic  polynomials  p(X,x)  which 

determine  the  orbit  of  (A„,**»,A  )  up  to  a  finite  number  and  we  estimate 

0  m 

from  above  the  number  of  distinct  orbits  corresponding  to  p(X,x).  In  Section 
8  we  study  polynomial  matrices  A(x)  with  constant  eigenvalues.  In  Section  8 
we  show  that  if  A,B  e  A+xB  has  constant  n  distinct  eigenvalues  then 
B  =  0  for  n  <  4.  This  result  implies  that  for  n  <  4  the  polynomial 
p(X,x)  =  I Xl  -  (A+xB) I  determines  at  most 

n  n-1 

M(n )  =  II  i !  /2 
i=l 

distinct  orbits  of  (A,B)  (and  generally  this  is  the  number  of  distinct 
orbits)  on  condition  that  p(X,x)  is  non-degenerate.  That  is  p(X,x)  =  0 
does  not  have  a  multiple  root  for  ail  x. 

For  n  >  5  this  result  does  not  apply.  Section  10  deals  with  a  general 
symmetric  polynomial  matrix  A(x).  We  show  that  if  p( X,x)  =  | Xl  -  A(x ) |  is 
a  non-degenerative  polynomial  and  for  each  finite  or  infinite  x  each  X 
root  of  p(X,x)  =0  is  either  single  or  double  then  there  are  at  most 
2(n-1)(mn-1)  distinct  orbits  corresponding  to  m+1  symmetric  tuples 
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)  such  that 

0  m  _ 

ni 

p(X,x)  =  I Xl  -  1  I . 

i=u 

The  proof  of  this  result  is  non-trivial  and  lengthy.  The  basic  idea  is  to  use 
the  theory  of  one  complex  variable,  in  particular  the  Liouville  theorem.  The 
last  section  is  devoted  to  various  remarks,  comments  and  conjectures. 
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1 ,  Classification  of  similarity  classes  of  tuples  of  matrices. 

Let  A  G  As  usual  denote  by  |a|  the  determinant  of  A.  In  what 

follows  we  adopt  the  notation  of  Marcus-Minc  [1964]. 

Denote  by  „  the  set  of  strictly  increasing  sequences  of  integers 

f  n 

a=  (a,*»*,a),  1  <  •••  <ot  4n.  For  any  rectangular  matrix  A  we 

1  iC  1  ic 

denote  by  A[a|3]  the  submatrix  generated  by  the  rows  a  =  ( )  and 
columns  3  =  ( 3^ ,  *  •  • ,  3^^ ) .  In  case  that  k  =  A  |A[a|3]|  denote  the  appro¬ 
priate  minor  of  A,  Also  A[i,jl  denotes  the  (i,j)  entry  of  A.  For  A,B 
G  let  L(B,A)  be  the  following  operator  on 

L(B,A)X  =  BX-XA,  X  6 

In  tensor  notation  L(B,A)  is  represented  by  the  matrix  I  8  B  -  a''  8  I. 

Here  by  A*"  we  denote  the  transposed  matrix  of  A.  See  for  example  Marcus- 

Minc  [1964,  p.  8].  Let  u(B„,***,B  ,A^, •••,A  )  be  the  dimension  of  the  sub- 

0  m  0  m 

space  of  matrices  satisfying 


(1.1)  B^X  -  XA^  =  0,  i  =  0, •••,m. 

For  B^=A^,  i=0,***,m  denote  the  dimension  of  this  subspace  by 

v(A  , •••,A  ).  Note  that  v(A„,»**,A  )  >  1  since  X  =  1  is  a  solution  of 
0  m  0  m 

(1.1)  tor  B.  =  A^ ,  i  =  0,*»»,m.  Denote  by  L(B„,***,B  ,A„,***,A  )  an 

0  m  0  m 

2  2 

(m+1 )n  xn  matrix  composed  of  the  submatrices  L(B^,A^),  i  =  0,***,m. 
Clearly 


(1.2)  rankL(B  .•••,B  ,A  .•••,A  )  =  n^-v(B  , •  •  •  ,B  ,A  .•••,A  )  . 

u  mu  m  0  mO  m 

Let  A,B  G  M^.  Suppose  that  B  =  TAT“’.  Then  L(A,X)  and  L(X,A)  are 
correspondingly  similar  to  L(B,X)  and  L(X,B)  for  any  x  G  M^^ 

I9B-X^8I  =  (I9T)(I9A-X*^9I)(I9T~S 


(1.3) 

I8X-B^«I  =  ((t’^)"’  9  I)(I  8  X  -  a’^  9  I)(T^  9  I) 

SO  If  T(A  , •••,A  )T~^  =  (B„,***,B  )  we  have 
0  m  0  m 
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■— :.-ra. 


0  m  0  m 


.V  diag{l  x  x  t}l(A„,***,A  ,X_,***,X  )(I  x  t"’ ) 

ll*4;  OlTiUm 


L(X  )  = 

U  m  0  m 


diag{(T^)“’  x  l,  •••,  x  i}l(X_,***,X  ,A„,***,A  XT*^  x  i), 

o  m  0  m 

Here  by  diag {a^, • • • ,A^}  we  mean  the  block  diagonal  matrix  with  matrices 
Aq/***,A^  on  the  main  diagonal.  By  choosing  X^  =  A^  in  the  first 
equality  in  (1.4)  we  get 


(1.5) 


v(B  .•••,B  ,A  ,***,A  )  =  v(A,.***,A  ) 
U  m  u  m  0  m 


if  (A  .•••,A  )  and  (B^,***,B  )  are  simultaneously  similar.  In  general, 
u  iti  u  ni 

the  equality  (1.5)  does  not  imply  the  similarity  or  (A„,***,A  )  and 

0  m 

(B  , •••,B  )  even  in  the  case  m=0.  See  Friedland  [1980].  However  (1.5) 
u  in 

implies  the  similarity  of  (B„,«»*,B  )  and  (A^,***,A  )  provided  that 

u  m  0  m 

(b,- )  lies  in  some  open  set  U  containing  )• 

Urn  0  m 

Theorem  1.6.  Let  A„,***,A  6  M  and  put  v  =  v(A_,***,A  ), 

■  0  m  n  Dm 

2 

r  =  n  -V.  Assume  that  a  e  Q  ,  B-c  Q  satisfy  the  following 

r,(m+1)n  - 


r  ,n 


assumptions 


(i)  there  exists  B„,***,B  e  M  such  that 

0  m  n 

(1.7)  lL(Bg,*.*,B^,AQ,***,A^)(ol8ll  *  0. 

(ii)  Identify  0  with  a  subset  of  WxW,  W=(l,...,n}.  In  the 
system  (1.1)  consider  r  equations  given  by  the  index  set  a.  Let 

X(t)  =  (*j^j  tl'®  unique  solution  of  these  r  equations  satisfying  the 

conditions 

’‘ij  "  ^A(i,j)'  ^  ^ 

I  V 

Assume  that  lx(C)l  does  not  vanish  identically  on  C  . 


Denote  by  U  „  the  set  of  all  (B„,...,B  )  which  fulfill  (i)  and 

(Xfjfi  u  m 

(ii).  Let  U  be  union  of  all  Uaey  ,BeQ  where  a 

01/  (i  y  a  V  2  ^ 

r, (m+1 )n  r, n 

and  0  satisfying  (i)  and  (ii).  Then  U  is  a  non-empty  open  (algebraic)  set 
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in  which  contains  (A„,»**,A  ).  Moreover  )  is 

n»ni+i  0  in  0  m 

simultaneous  similar  to  (A„,*»»,A  )  if  and  only  if  (B„,***,B  )  €  U  and  the 

0  m  u  m 

followinq  equalities  hold. 


(1.8)  |l(B  .•••,B  ,A  .•••,A  )(y|<S)|  =0,  YCg 

^  “  r+1,(m+1))n^  r+1,n^ 

Proof  Let  o  and  6  satisfy  the  assumptions  (i)  and  (ii).  Since  the  coef¬ 
ficients  of  the  variables  in  the  polynomial  |x(C)|  are  rational 

functions  in  the  entries  of  (A  ,»»*,A  )  and  (B„,**».B  )  it  follows  that 

Urn  0  m 

U  -  IS  an  open  algebraic  set.  That  is  U  „  is  a  union  of  a  finite  number 

a,  3  r  ^  g 

of  sets,  each  of  them  is  characterized  by  some  non-vanishing  polynomial.  Sup¬ 
pose  that  (B^,***,B  )  is  simultaneous  similar  to  (A^,***,A  ).  Then  the 
0  m  0  m 

matrices  L(  A^^,  •  *  •  ,A^,Aq,  •  •  •  ,A^)  and  *  *  *' *^m'^0 '  *  *  * '*m^  equiv¬ 

alent.  Therefore  the  above  matrices  have  the  same  rank.  So  there  exists 

a  6  y  2  snd  3  6  y  ^  which  satisfy  the  assumption  (i).  Further- 

r, (m+1 )n  r,n 

more  as  B^^  =  TAj^t“\  i  =  0,*«»,m  the  polynomial  |x(5)|  does  not  vanish 

identically.  This  shows  that  U  is  a  non-empty  open  algebraic  set,  and 

a,  P 

hence  U  is  a  non-empty  algebraic  open  set.  Moreover  if  (B„,**»,B  )  is 

0  m 

simultaneously  similar  to  (A„,*»»,A  )  then  (B„,*»*,B  )  6  U.  In  particular 

0  m  urn 

(A„,***,A  )  6  U.  Assume  that  (B^,»**,B  )  6  U.  Then  there  exist  a  non- 
0  m  0  m 

singular  matrix  X  which  satisfies  r  independent  equations  of  (1.1).  The 
equality  (1.8)  implies  that  all  other  equations  of  the  system  (1.1)  are  linear 
combinations  of  these  r  equations.  So  X  satisfies  (1.1)  which  means  that 


(B  , •••,B  )  is  simultaneous  similar  to  (A„,***,A  ).  ■ 

0  in  0  m 

Let  V  and  r  be  defined  as  in  Theorem  1.6.  Consider  the  following 

algebraic  variety  X  in  defined  by  the  equations 

(1.9)  |L(x„,—,x_„,A„,...,A^)|a|Bl|  .  0,  ««Q  ; 

r+1 , (m+1 )n  r+1 ,n 


This  variety  splits  to  k  irreducible  varieties 

X 


(1 .10) 


X  =  u  X.. 
i=1 
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See  Section  3  for  various  properties  of  algebraic  varieties  needed  here  and 
the  appropriate  references. 


In  Section  3  we  show  that  <  is  bounded 

, ,  , ,  .  ^  v(mn^+v)  2 

(un)  K<r  ,v  =  n-r. 

Let  U  be  an  open  set  defined  in  Theorem  1.6.  According  to  Theorem  1.6 

X  nU  c  orb(A^^,  •••,A  ). 

u  m 

Since  the  orbit  of  (A  .•••.A  )  is  a  manifold  of  dimension  r  it 

0  m 

follows  that  the  point  (A  ,**»,A  )  is  contained  exactly  in  one  irreducilbe 

0  m 

variety  -  say  X^ .  Also  the  dimension  of  X^  is  r. 

Definition  1.12.  Let  ^  be  the  set  of  all  matrices 

(A^,«**,A  )  6  M  such  that 

0  m  n,m+1 


|l(A  ,***,A  ,A  .•••,A  )Ia|e)|  =0,  afe’Q 

u  mo  m  .  /  .  V  2 

r+1 , (m+1 )n 


r+1  ,n 


By  ^  denote  the  open  (algebraic)  subset  of  ^  of 

n,m+1,r  ’  n,m+l,r 


(Aq,*»»,A^)  satisfying 


v(A  , •••,A  )  =  n  -r  . 
0  m 


(It  may  happen  that  ,  is  empty).  Let  ic(n,m,r)  be  the  maximal  number 

n ,m+l ,r 

of  irreducible  varieties  X^  of  dimension  r  in  the  decomposition  (1.10)  of 

the  variety  X  given  by  (1.9)  having  a  non-empty  intersection  with 

,0 


n,m+1 ,r 
( ic(n,m,  r )  =  0  if 


for  all  possible  choices  of  (A„,***,A  )  6  W  .  , 

0  m  n, m+1 , r 

,,0 

M  is  empty). 

n,m+1  ,r  ^ 

So  (1.11)  implies 

2 

I,  ^  ''(mn  +v)  2 

(1.13)  K(n,m,r)<r  ,  v=n-r. 

Theorem  1.14.  Let  (A„,«»»,A  )  t  ,  Let  Xi  be  the  irreducible 

-  0  m  n,m+1,r.  1 

component  of  the  variety  (1.9)  containing  the  point  (A„,***,A  ).  Then 

0  m 

orb(A  ,»**,A  )  is  an  open  algebraic  set  in  Xi»  That  is 
0  m  I 

(1.15) 


Moreover 

(1.16) 


oro(A^, •••,A  )  =  X  . 
0ml 


orb(A  ,  •••,A  )  =  X  n  iM 

0  m  1  n,m+1 ,r 
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1 


Proof.  As  orb(A„,***,A  )  is  a  manifold  of  dimension  r  by  Theorem  1.6  we 
-  0  m 

get  orb(A^,***,A  )  c  X  .  Let  u  be  defined  as  in  Theorem  1.6.  According  to 
0  ml 

Theorem  1 . 6 

orb(A„,»**,A  )  =  n  u. 

0ml 

As  u  is  an  open  algebraic  set  in  we  get  that  orb(AQ ,  •  • ‘.A^ ) 

is  an  open  algebraic  set  in  the  irreducible  variety  so  (1.15)  holds. 

(Here  ort>(A,  .•••,A  )  means  the  closure  of  the  orbit  set  of  (A„,***,A  )). 

0  m  0  m 

Let  (B  )  fc'  X  n  ,  So  any  neighborhood  of  (B„,***,B  ) 

0  m  1  n,m+1,r  ^  0  m 

contains  a  point  (C^,***,C  )  6  orb(A^,*»*,A  ).  According  to  the  equalities 

0  m  0  m 

(1.3)  the  matrices  L(B  .•••,B  ,A  ,***,A  )  and  L(B„,**»,B  )  are 

0  inO  m  0  mO  m 

equivalent.  As  (B„,**»,B  )  6  X.  we  deduce 
0ml 

-“.'S-— ,,  ,  2- 

r+1 , (m+1 )n  r+1 ,n 

As  analogous  results  to  Theorem  1.6  yields  that  there  exists  a  neighborhood 

U  of  (B  , ••♦,B  )  such  that  (C„,**»,C  )  6  orb(B„,*»»,B  )  provided  that 
0  m  0  m  0  m 

A 

(C^,***,C  )  e  U  and  the  above  equalities  hold.  So  there  exist  ) 

0  m  ’  0  m 

which  lies  in  the  orbits  generated  by  (A„,»»»,A  )  and  (B-,**»,B  ).  Whence 

0  m  0  m 

(B  -•••,B  )  e  orb(A  -•••,A  ).  As  orb(A  .•••,A  )  <=  ve  finally 

0  m  0  m  0  m  n,m+1,r 

deduce  the  equality  (1.16).  [] 

Consider  the  left  hand  side  of  the  equalities  (1.9).  These  are 


multilinear  polynomials  in  the  entries  of  .  Let  be  the 

0  m  n,m+1 

linear  space  of  all  multilinear  polynomials  in  the  entries  of  X  .•••,X  .  We 

0  m 

formally  define  W^.m+I  follows.  Denote  by  the  set  of  all  nxn 

matrices  y  =  ^ )  with  (0,1)  entries,  i.e.  =0,1.  For  X  =  (x^^)  6 

we  let 

=  n  Y  6  (x'^.  =  1). 


1<i, j<n 


1-3 


Then  consists  of  all  polynomials 

(1.17) 


1  n 
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Clearly 

(1.18) 

With  this  notation 


1,1  ,,(m+1)n 

dim  w  .  -  ^ 
n,m+1 


(1.19) 


L(o.'A„.  — .r 


Y.6M 
1  n 


Y 

«  X 
'  m  ' 


“eg  2'^®^  2 

r+1 , (m+1 )n  r+1 , n 


•  ( 


•|A  )  the 

0 

m 

(m+1 )n^ 

n^  2 

r+1 

r+1 

matrix  whose  rows  are  the  vectors 


‘V,—,r  V" 

u  m 


In  fact  the  matrix  P(A„,***,A  )  contains  a  lot  of  zero  columns.  Indeed  if 

we  pic)i  up  a  minor  of  the  form  (1.19)  then  each  entry  of  such  a  minor  consists 

of  a  sum  one  element  of  some  Xj^  and  some  Aj .  Therefore  the  polynomial 

(1.19)  contains  exactly  (r+1)l  moninomials  of  degree  r+1  of  the  form 

in  the  entries  of  ^o'***'^m*  number  of  distinct 

mononomials  of  degree  r+1  appearing  in  all  possible  expression  in  (1.19)  is 

2  2 

exactly  (r+1)l[  J(  The  number  of  distinct  monomials  of  degree 

2  2 

d  <  r  which  may  appear  in  (1.19)  is  )(^  )•  Hence  the  upper  bound 


for  nontrivial  columns  appearing  in  P(A,,,*»»,A  )  is 

U  m 


(1.20)  N  =  T  d! 

n,m,r  '■  d  -''■d  '  '■  r+1  ■''■r+1-' 

d=o 


2  2 


2  2 


By  M  ,  denote  the  subset  of  all  tuples  (A„,***,A  )  e  M  ,  such 

n,m+1,r,p  ^  0  m  n,m+1,r 

that  ran)c  P(A„,»**,A  )  <  p. 

0  m 
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be  the 


Clearly,  M  ,  „  is  an  algebraic  set.  Let  M 

n,in+1,r,p  n,m+1,r,p 

subset  ot  ail  tuples  )  6  ,  satisfying 

0  m  n,m+1,r 


(1.21) 


rank  P{A  , •••,A  )  =  p. 
u  m 


Again,  it  is  easy  to  see  that  M  is  an  open  (algebraic)  set  in 

n/in*f  I  ftfp 

M  .  We  now  give  sets  of  invariant  rational  functions  which  determine 

n,m+1,r,p. 

uniquely  a  finite  number  of  orbits  lying  in  ,  To  do  that  we  need 

n ,in+}  ,r, p. 

to  recall  the  notion  of  Hermite  normal  form  of  a  rectangular  matrix  A.  See 
for  example  Marcus-Minc  (1964).  Two  pxq  rectangular  matrices  A  and  B 
are  called  row  equivalent  (A'B)  if  there  exists  a  nonsingular  matrix  Q 
such  that  B=yA.  Any  pxq  matrix  A  can  be  brought  to  the  unique  Hermite 
normal  form  E  =  E(A)  using  the  elementary  row  operations.  E  =  (e^j  )  is 
characterized  by  the  conditions 

)  ^  <  P2  <  *  *  *  Pp  ^  P»  P  “  rank  A, 

,p  ,  i=1,»**,p 


(1.22)  e.„  =1,  e.^  =  0  for  j  <  p. ,  for  3  =  p 

i 


iP,  ■'  ’ij  ■  ■”  •'  '  "^i'  '^i+l'  ""p' 


e.  =0  for  i  >  p. 
iq 

The  integers  P^»***»Pp  are  called  the  discrete  invariants  of  A  and  the 

entries  P^  <  q  ^  P^^^»***»Pp»  i  =  1r***»P»  are  called  the  continuous 

invariants  of  A.  Given  Pw***»P  then  the  continuous  invariants  are  well 

*^1' 

determined  rational  functions  of  A. 

Theorem  1.23.  Let  r  and  p  be  positive  integers  such  that  ,  is 

" *  *  n  f  rn"^ i  ^  ir  ^  p 

nonempty.  Assume  that  (A„,**»,A  ),(B„,»**,B  )  6  ,  .  If 

0  m  0  m  n,m+1,r,p 

(A,^,***,A  )  and  (B,,***,B  )  are  simultaneously  similar  then 
0  m  (J  m 


(1.24) 


P(A  .•••,A  )  '  P(B  ,•••,8  ). 
0  m  0  m 


Moreover,  there  are  at  most  <(n,m,r)  distinct  orbits  in  M  which 

n,m+1 ,r , p 

satisfy  the  equality  (1.24).  That  is  the  discrete  and  the  continuous 


invariants  of  P(A„,***,A  )  determine  at  most  ic(n,m,r)  distinct  orbits  in 
U  m 

n,m+1 ,r ,p 
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Then  (1.4)  yields  that 


Proof.  Assume  that  (B  ,•••,!!  )  =  T(A  ,  )T  \ 

-  0  m  O  m 

the  matrices  L(X„,***,X  ,A„,***,A  )  and  L(X^.  •••,X  ,B  ,  )  uie  equiva- 

0  mO  m  0  mO  m 

lent.  Moreover  the  transformation  matrices  do  not  depend  on  the  matrices 

X  .•••,X  .  So  any  Itxk  minor  of  L(X.-***,X  ,B  )  (L(X  ,  •  • ‘.X  ,A  , 

O  m  0  mO  m  0  mo 

••*,A  ) )  IS  a  fixed  linear  combination  of  all  )cx)t  minors  of  L(X  ,  *•*, 
m  o 

X  ,A„,***  A  )  (L(X  ,***,X  ,B  )).  So  the  matrices  P(A^,»**,A  )  and 

mOmOmOm  Om 

P(B  .•••,B  )  can  be  transformed  each  one  to  the  other  one  by  elementary  row 
0  m 

operations.  That  is  the  equality  (1.24)  holds. 

Consider  the  algebraic  variety  defined  by  (1.9).  In  the  decomposition 


(1.10)  we  may  assume  that  are  all  the  irreducible  varieties  which 

have  exactly  the  dimension  r  such  that  X;  n  ,  is  an  open  non- 

1  n,m+1,r,p 

empty  set  in  X^  for  i  =  1,***,T.  By  the  definition  of  ic(n,m,r), 


T  <  K(n,ra,r).  Suppose  that 

0  mi  n. m+1 ,r , p 

and  assume  that  (A^^ ^ • ,A^^ ^ )  is  a  regular  point  in  X..  That  is  X.  is 
0  m  11 

a  manifold  of  dimension  r  in  the  neighborhood  of  (A^^ ^ ^ ).  We 

0  m 

assume  the  normalization  ~  0,  •••,m.  So  X^  =  orb(Ap,  •  •  *,A^ ) . 

The  equalities  (1.4)  yield  that  orb(A^,^  \  • ’.A^^  ^ )  satisfies  the  system 

0  m 

(1.9).  As  orb (A^^  \  , A^ ^ ^ )  is  a  manifold  of  dimension  r  which  passes 

0  m 

through  a  regular  point  (A^^ \  • ‘.A^^ ^ )  of  X.  we  deduce  that 

0  in  ^ 

(A^^  \  • ,A^^ ^ )  c  X,.  Now  Theorem  1.14  implies  that  orb (A^^ ^ A ^^ ^ )  is 

0  m  —  1  0  m 

an  irreducible  variety.  So  orb (A^^ ^ ^ )  =  X. .  Let  (B„,**»,B  )  6 

0  m  1  0  m 

,  and  assume  that  (1.24)  holds.  Thus  any  (r+1 )  x(r+1 )  minor  of 

n,m+1 ,r , p 

L(X„,***,X  ,A„,***A  )  is  a  linear  combination  of  all  minors  of 

0  m  0  m 

L(X^,***,X  ,B^-***,B  )  and  vice  versa.  Therefore  (X„,***,X  )  satisfies 
0  m  0  m  0  m 

the  system  (1.9)  if  and  only  if  (X  .•••,X  )  satisfies  the  system 

0  m 


(1.2b)  L(x„,***,x  ,B  .•••,B  )[a|0)  =0,  atg  _,3fe'g 

u  m  U  m  .  /  .  1  2 

r+1 , (m+1 )n  r 


r+1  ,n 


As  v(B  ,***,B  )  =  n  -r,  (B  ,••*,6  )  is  a  solution  to  the  above 
0  m  0  m 
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equalities.  Whence  )  satisfies  the  equations  (1.9).  Therefore 

0  in 

orb(B,^,***B  )  satisfies  the  equalities  (1.9).  Since  (B-,***.B  )  6 
0  m  ^  0  m 

,  we  deduce  that  orb(B„,***,B  )  c  ,,  So 

n,m+1,r,p  0  m  n,m+1,r,p 

orb(B,,***,B  )  is  an  irreducible  variety  of  dimension  r  such  that  its 
0  m 

intersection  with  ^  is  a  non-empty  open  set  in  orb(B_,***,B  ). 

n,m+1 ,r,p  r  i  v  0  m 

Hence  orb(B„,***,B  )  =  for  some  1  <  i  <  T.  As  (B  )  S 

0  m  1  0  m  n,m+1  ,r 

Theorem  1.14  yields  that  (B„,***,B  )  is  simultaneously  similar  to  (A^^^,**», 

0  m  u 

A^^^).  Thus,  then  at  most  T  (<  K(n,m,r))  distinct  orbits  such  that  any 
m 

two  points  on  those  orbits  satisfy  the  equalities  (1.24).  (1 

Let  (A„,***,A  )  e  .  Assume  that  u  =  (p^,***,p  )  is  the  set 

0  m  n,m+1,r,p  1  P 

of  discrete  invaviants  of  P(A„,**»,A  ).  Let  V^(uj)  be  of  the  set  of  all 

0  m 

(B^,***,B  )  e  .  for  which  o)  is  the  set  of  the  discrete  invari- 

0  m  n,m+1,r,p 

ants  of  P(B„,»**,B  ).  Clearly  V*^(u))  is  an  open  set  in  ,  .  More- 

0  m  n,in+l  > r /  p 

over,  ,  is  a  finite  union  of  set  V^(a)),  for  admissible  a>  6  Q  , 

n,m+l,r,p  P^n^ 

ho  =  N„  „  On  each  V^(a))  the  set  of  continuous  invariants  of 
I  n,m,r 

P(A„,***,A  )  -  which  are  rational  functions  in  the  entries  of  A,.,*»*,A 

urn  0  m 

classify  the  orbits  in  V*^(a))  up  to  a  finite  number  not  exceeding  tc(n,m,r). 

We  conjecture 

Conjecture  1.26.  Let  (A„,***,A  ),  (B„,***,B  )  6  .  •  Assume  that  the 

- ^ -  0  m  0  m  n,m+1,r,p 

equality  (1.24)  holds.  Then  (A„,***,A  )  and  (B„,***,B  )  are  simultane- 

0  in  u  in 

ously  similar. 

For  m  =  0  the  above  conjecture  is  valid.  Indeed,  according  to  Friedland  [1980] 


(1.27) 


v(B^,Aq)  4  (v(Aq,Ao)  ^ 


and  the  equality  sign  holding  if  and  only  if  A^  and  Bq  are  similar. 

According  to  the  proof  of  Theorem  1.23  the  equality  (1.24)  implies  that  all 
(r+1)x(r+1)  minors  of  L(By,AQ)  vanish.  So 

v(Bo,Ao)  >  n^-r  =  V(Ay,A^)  =  V(B^,B^). 

Hence  we  must  have  the  equality  sign  in  (1.27)  which  means  that  Af,  and  are 


similar.  In  the  next  section  we  verify  the  above  conjecture  for  m=1  and  n=2. 
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2.  The  2X2  case 


Ijet  A,B,  6  M2.  Clearly 

(2.1)  ()>,  =  tr(A),  ^2  =  tr(A^),  4>3  =  tr(B),  =  tr(B^),  iji^  =  tr(AB) 

are  invariant  polynomials  under  the  simultaneous  similarity. 

Theorem  2.2.  Let  U  be  the  following  algebraic  variety 

(2.3)  U  =  {(A,B)l{2tr(A^)-[tr(A)l^}{2tr(B^)-[tr(B)l^}  =  [2tr(AB)  -  tr ( A) tr ( B) ] 
Then,  for  any  pair  (A,B)  not  lying  in  1)  the  orbit  of  (A,B)  is  deter¬ 
mined  uniquely  by  the  values  of  <)>j^(A,B),  i  =  1,»**,5. 

Proof .  Suppose  first  that  A  has  distinct  eigenvalues  '^2'  ^1  ^  ^2* 

2  2 

is  ttr(A)l  *  2tr(A  ).  The  eigenvalues  and  X^  are  determined  by  the 

9 

values  of  tr(A)  and  tr(A  ).  Then  we  can  choose  a  pair  (D,E)  lying  in  the 
orbit  of  (A,B)  such  that 


(2.4)  D  =  D(X^,X2) 


0 

'^1 

^2' 

^2. 

'  E  =  E(e,,,e22,e^2'®2l’  = 

®21 

®22 

/ 

(2.6)  621  =  1. 

In  that  case  D  and  E  are  determined  uniquely  by 
tr(D)  =  tr(A),  tr(u2)  =  tr(A^),  tr(E)  =  tr(B), 
tr(E^)  =  tr(B^),  tr(DE)  =  tr(AB). 
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Suppose  that  (tr(A)]^  =  2tr(A^).  Consider  the  matrix  (XA+Sb.  Then 
2tr  [  (aA+$B)^]  -  [tr(oiA+0B]^  = 

(2.7)  a^{2tr(A^)  -  (tr(A)]^}  +  B^{2tr(B^)  -  (tr(B)]^} 

+  2a3{2tr(AB)  -  tr(A)tr(B)}. 

Thus  aA+3B  has  a  double  eigenvalue  for  all  a  and  3  if  and  only  if 
A,B)  lies  on  the  variety 

(2.8)  V  =  {(A,B) 1 2tr (A^)  =  [tr(A)]^,  2tr(B^)  =  [tr(B)]^, 

2tr(AB)  =  tr(A)tr(B)}. 

Since  V  c  u  we  see  that  (A,B)  ^  V.  So  we  can  choose  3^0  such  that  A^ 

A  +  3B  has  distinct  eigenvalues.  Thus,  there  exists  a  matrix  X  such  that 
XA^X“^  =  D,  XBX“^  =  K. 

Again,  if  e^^e22  =  0  then  A^  and  B  are  simultaneously  similar  to  upper 
(lower)  triangular  matrices.  So  A  and  B  are  simultaneously  similar  to  upper 
(lower)  triangular  matrices  which  contradicts  the  assumption  (A,B)  6  U.  That 

is  the  orbit  of  (A,B)  contains  a  matrix  of  the  form  D-3E  and  E  where 
D  is  diagonal,  621”^  ®  defined  uniquely  by 

^(A,B),  i  =  !,•••, 5  having  fixed  the  value  of  3>  ■ 

we  now  examine  the  matrix  meaning  of  the  variety  U. 

Theorem  2.9.  A  pair  of  matrices  {A,B)  belongs  to  U  if  and  only  if 
(A,B)  is  simultaneously  similar  to  a  pair  of  upper  triangular  matrices. 

Proof .  Assume  first  that  A  has  two  distinct  eigenvalues  ^^2* 

Then  (A,B)  is  simultaneously  similar  to  a  pair  (D,E)  where  D  and  E  are 
given  by  (2.4).  A  straightforward  calculation  yields 
2tr(A^)  -  [tr(A))^  =  2tr(D^)  -  [tr(D)]^  = 

(2.10)  2tr(B^)  -  ltr(B)l^  =  2tr(E^)  -  [tr(E)]^  =  (e^^-e22)^  +  4e^2®21 


2tr(AB)  -  tr(A)tr(B)  =  2tr(DE)  -  tr(D)tr(E)  =  < "^2 ^ 1 1 "^22 ’ ’ 
The  assumption  that  (A,B)  6  U  means 


(X^-A2)2[(e,^-e22)^+4ei2e2,]  =  <  \ '^2 ’ ^ 11 “^22 ’ 
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0.  If  e 


0  then  U  and  E  are 


Since  ®12®21 

upper  triangular.  Suppose  that  e^2  “ 


21  ~ 
Define 


Then  (A,B)  are  simultaneously  similar  to  the  upper  diagonal  matrices  (PDP"\ 
PEP“^).  This  establishes  the  lemma  in  case  that  A  has  distinct  eigen¬ 
values.  Assume  that  A  has  a  double  eigenvalue.  If  A  =  Xl  then  clearly 
(A,B)  are  a  simultaneously  similar  pair  of  upper  diagonal  matrices.  So 


suppose  that  (A,B)  are  simultaneously  similar  to  a  pair  (D,E) 


(2.11  ) 


(5  !)• 


'1 1 
*21 


). 


Since  (A,B)  €  U  we  must  have 

0  =  2tr(AB)  -  tr(A)tr(B)  =  2tr(DE)  -  tr(D)tr(E)  =  2e2,. 
that  is  (D,E)  are  upper  triangular.  I 

Corollary  2.12.  The  variety  U  is  invariant  under  the  linear  transformation 
(2.13)  A^  =  OA  +  0B,  B^  =  YA  +  6b,  o6  -  0Y  ^  0, 


=  {(A,B)|(A,B)  e  U,  2tr(A^)  *  [tr(A)]^} 

=  {(A,B)|(A,B)  €  U,  2tr(B^)  *  [tr(B)^]}. 

Theorem  2.15.  On  or  U2  the  values  of  ^^(A,B),  i  =  1,***,5 

correspond  to  3  distinct  orbits.  On  Ui  n  U2  the  values  of  i^^(A,B), 
i  =  1,2, 3, 4  correspond  to  6  distinct  orbits,  while  on  u  ^2  -  U-]  n  U2 
these  values  correspond  to  3  distinct  orbits. 

Proof ;  Suppose  that  (A,B)  6  .  Then  (A,B)  is  simultaneous  similar  to 

(D,E)  given  by  (2.4).  From  the  proof  of  Theorem  2.9  it  follows  that  ®i2®21 
=  0.  So  we  have  the  possibilities  e^2  “  ®  ®21  ^  ®21  “  ® 

e^2  ^  0  and  ®i2  “  ®21  “  proof  of  Theorem  2.2  using  diagonal 

similarity  we  can  assume  that  E  may  have  the  following  form 

E^  —  E(e^^,e22f1/(^)r  ^2  “  ^3  ~  b(e^^,e22f(^»(^)* 


Let 

(2.14) 
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We  claim  that  the  pairs  (D.Ej^)  and  D,Ej)  are  not  simultaneously  similar 
tor  i^j.  Indeed,  suppose 

D  =  XDX“\  £j  =  XEX"’,  i^j. 

Since  U  IS  diagonal  with  distinct  eigenvalues  the  tirst  equality  implies 
that  X  is  also  diagonal .  Then  the  second  equality  is  impossible.  Let 
and  eigenvalues  of  B.  Clearly  ~ 

either  U,  =  e^^.  =  ^22  ^  =  ®22'  *^2  =  ^1* 

It  y^  *  y^  then  ^  XE^X  ^  for  any  non-singular  diagonal  matrix  X.  This 

shows  that  for  (A,B)  S  n  U2  the  values  of  the  functions  (ii^{A,B),  i  = 
1,2, 3, 4  corresponds  to  6  distinct  orbits.  If  B  has  a  multiple  eigenvalue 
then  E^  =  E^  and  the  above  values  correspond  to  3  distinct  orbits.  Thus 
we  proved  that  on  u  U2  -  n  U2  the  values  of  (|)^(A,B),  1  4  i  <  4, 

determine  3  distinct  orbits.  Suppose  we  are  also  given  (J>^(A,B)  =  tr(AB). 

Note  that 

(X^yi+X2y2)  -  (\vVi^  = 

Thus  for  (A,B)  Vs  n  U2 

tr (DE. )  *  tr (DE. ). 

1  3 

Therefore  on  u  U2  values  of  <j)^(A,B),  1  4  i  4  5  correspond  to  3 

distinct  orbits.  ■ 

Let  (A,b)  G  U.  Assume  that  (A,B)  is  simultaneously  similar  to 

diagonal  matrices  (D.Ej).  Then  A  and  B  commutes.  That  is  (A,B)  lie  on 

the  manifold 

(2.16)  C  =  {(A,B)|Ab-bA  =  0}. 

Clearly  U  n  c  is  a  subvariety  of  U.  That  is  a  generic  orbit  in  U 
contains  a  pair  of  the  form  (D,E^).  The  fact  that  the  orbit  corresponding 

to  (D.E^)  IS  not  generic  can  be  seen  in  the  following  way.  Consider  the 

orbit  orb(A,B)  as  a  manifold.  Denote  by  dim  orb(A,B)  the  dimension  of 
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this  manifold.  Then 


dim  orb(A,E. )  =2,  A  6  , 


(2.17) 


dim  orb(0,E^)  =  2, 


D  6  U 


Indeed  consider  all  matrices  X  which  commutes  with  D  and  E^ 


DX  -  XD  =  0,  E^X  -  XE^  =0,  D  6  . 

For  i=1,2  X  =  Al  and  for  i=3  X  is  any  diagonal  matrix.  This 
establishes  (2.17).  We  shall  see  later  in  this  section  that  f^(A,B), 

1  <  i  <  5  are  generators  of  fv]  . 

Prom  the  proof  of  Theorem  2.15  it  follows  that  the  values  of  these 
functions  do  not  separate  between  the  two  generic  orbits  corresponding  to  the 


pairs  (D,E^ )  and  {D,E2),  D  6  .  According  to  the  result  in  Friedland 


[1981]  there  exist  rational  functions  0^  6  (U)  ,  i  =  1,**»,r  which  separate 


the  orbits  in  U-W,  for  some  algebraic  subvariety  W  in  U.  We  now  give  such 


functions  0^.  Let  (A,B)  e  U.  Then  Theorem  2.9  claims  that  A  and  B 


simultaneously  similar  to  a  pair  of  upper  triangular  matrices.  Suppose  that 


(A,B)  e  U.  n  Un  and  dim  orb(A,B)  =  3.  Then  A  =  (a. .)^  and  B  =  (b. .)^ 

'  ^  IT  1  Ij  1 


i3  1  13 

have  exactly  one  common  eigenvector  (x^,X2)^.  Corresponding  to  the 


eigenvalues  and  respectively 


(aii-A,)Xi  +  a, 2*2  =  ®  ®21*1  ^  ‘®22-^ ’*2  '  ° 


(b„-1il)Xi  +  b,2X2  =  0  b2,x,  +  (b22"^>*2  =  °* 


Assume  for  simplicity  that 


a. ^b^.x.x^  *  0. 

12  21  12  21  1  2 


Then  we  have  the  equalities 

<®irS>^2  =  ‘^r‘'l^^2'  S2-S>‘^21  =  ‘‘^22"^>*21' 


So 


b  b  a  —a  b  b  b  a  —a  b 

12  ,  11  12  11  12  21  ,  22  21  22  21 
U  =  -  A  +  -  =  -  A  +  - - - , 

’  ^2  ’  *12  *21  ’  *21 
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from  the  last  two  equalities  we  can  compute  the  value  ot  and  then  the 


value  of  h. 


(2. 1«) 


=  a(A,B)  = 


=  S(A,b)  = 


*12^21  ~  ^21*^12 


1~^22^^12*^21  *^22^12^21  ~  ^11^21^12 
^12^21  “  ^^21^12 


Theorem  (2.19).  The  functions  a(A,B),  0(A,B)  belong  to  (U)  .  Moreover 
these  functions  are  defined  on  orb(A,B)  such  that  (A,B)  G  U-C  and  the 
values  of  a(A,B),  0(A,B),  tr(A),  tr(B)  determine  these  orbits  uniquely. 
Proof.  Clearly,  a(A,B),  0(A,B)  e  (U)  since  ^12*^21  “  ^21*^12 
vanishing  identically  on  U.  Put 

=  tr(A)  -  a(A,B),  =  tr(B)  -  a(A,B). 

Then  a  straightforward  calculation  yields 

a2  ,2  ,2,  2  2^  _2, 

+  X^=  tr(A  ), 

as  (A,B)  G  U.  That  is  a(A,B)  and  3(A,B)  are  the  eigenvalues  of  A  and 

Q 

B  respectively.  This  shows  that  a, 3  G  (V)  .  Assume  that  A  G  and 
AB-4A  *  0.  Then  the  orbit  of  (A,B)  contains  a  pair  (D,E^ ).  Clearly 
a(A,B)  and  3(A,B)  is  not  defined  for  (D,E^).  Let  (PjQ)  6  orb(A,B) 
lying  closely  to  (D,E.|).  That  is 

P  =  (I+X)~^D(I+X),  Q  =  (I+X)~’e, (I+X), 
p  -  8  -  . 


Lp  ■  ”2.  -  'VS  >■‘21  * 


q^2  =  I  +  0(»Xll),  q^,  =  ^ 


Pl2‘^2rP21^12  =  <S-^2><V‘‘l^^2’'21  '  ^VS’^21  ^ 

Whence  it  is  possible  to  find  (P,0)  G  orb(A,B)  such  that  the  above 
expression  is  different  from  zero.  Put 


a(A,B)  =  a(p,y),  3(A,B)  =  3(P,y) 
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and  the  functions  a  and  6  are  well  defined.  The  matrices  (U,t^ )  have 
exactly  one  common  eigenvalue  which  corresponds  to  and  e,,.  So 


X^  =  a(A,B), 


e^^  =  6(a,b) 


X^  =  tr(A)  -  a(A,B),  0^2  =  tr(B)  -  e(A,B) 

and  the  matrices  D  and  are  determined.  The  same  arguments  apply  if 

B  6  U21  AB-BA  *  0.  Suppose  that  A  4  and  B  4  ^2'  That  is  (A,B)  is 

simultaneously  similar  to  (E(X,C)»  E(li»h))  where 


(2.20) 


E(X,C)  =  (i  f). 


*■0  X-* 

But  then  E(X,C)  and  E(p,n)  commute.  The  proof  of  the  theorem  is 
completed.  B 

For  any  2^2  matrices  A,B  the  (1,1)  entry  of  AB-BA  equals  to  3-12*^21 


‘21 


b^2*  That  is  a^2^21  ~  ^21*^12  vanishes  identically  on  C  and  therefore 


the  functions  a(A,B)  and  6(A,B)  are  not  defined  on  C. 

If  A  6  and  A  and  B  commutes  so  A  and  B  are  simultaneously 
diagonal  then  rrom  the  proof  of  Theorem  2.15  it  follows 

Theorem  2.21.  On  (U^  u  U2  n  C)  the  values  of  the  functions  (^^(A,B),  i  = 

1,2, 3, 4, 5  determine  the  orbit  of  (A,B). 

We  are  left  with  the  orbits  in  U  n  C  such  that  A  and  B  have  double 
eigenvalues.  It  means  that  (A,B)  are  simultaneously  similar  to  (E(X,5), 
E(u,ri)).  As  E(X,C)  and  E(M,h)  commutes  we  get 

(2.22)  V  c. 

Thus  we  need  to  classify  the  orbits  in  V. 

Theorem  2.23.  Let  (A,B)  S  V.  Then 
(2.24) 

belongs  to  (V)  .  Suppose  that  either  Y{A,B)  or  1/Y(A,B)  is  defined  on 
the  orbit  of  (A,B).  Then  the  values  of  tr(A),tr(B)  andY (A , B) ( 1 / Y(A , B) ) 
determine  a  unique  orbit  in  V. 


Y(A.B)  . 
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Proof .  According  to  what  we  proved  orb(A,B)  contains  a  pair  (E(A, C) 


t;( P, n)  ) .  Clearly 

X  =V2tr(A),  y  =V2tr(B). 

By  considering  the  matrices 

A^  =  A  -  Xl,  B^  =  B  -  yi 
we  can  assume  that  X=0  and  li=0.  Let 

A  =  XE(0,C)x“’,  B  =  XE(0,n)X~\ 

Then  a  straightforward  calculation  shows  that  Y(A,B)  =  ?/n. 

That  is  Y(a,B)  is  indeed  an  invariant  function  on  V.  Assume  that 
n  ^  0.  In  that  case  Y(A,B)  is  well  defined.  Choosing  an  appropriate 
diagonal  matrix  X  and  considering  the  matrices  XE(X,5)X  \  XE(y,ri)X  ^  we 
can  assume  that  ri=1 .  Then  5  =  Y(A,B).  Hence,  tr(A),tr(B)  and  Y(A,B) 
determine  the  orbit  of  (A,B).  The  same  arguments  apply  if  ^  *  0,  i.e. 
1/Y(A,B)  is  well  defined.  ■ 

Suppose  that  neither  Y(A,B)  nor  1/Y(A,B)  are  not  defined  on  the  orbit 
of  (A,B).  Then  we  must  have  that  (A,B)  lie  on  W 

(2.25)  W  =  {(A,B)|a^2  =  ^21  =  "^21  =  *^12  =  =  ^22'  Si  =  ^2^* 

Clearly,  on  W  tr(A)  and  tr(B)  determine  the  orbit  completely. 

Thus  we  completed  the  classification  of  the  simultaneous  similarity  of 
2x2  pairs  of  matrices  according  to  the  program  outlined  in  Friedland 
[19B1J.  Next  we  note  that  on  all  subvarieties  in  *^2^2  V  the 

values  of  the  functions  (|i^(A,B),  1  <  i  <  5  determine  a  finite  number  of 

orbits.  We  claim  ttiAt  on  V  the  values  of  any  set  of  invariant  polynomial 
functions,  i.e.  functions  belonging  to  [V]  ,  cannot  in  general  determine  a 
finite  number  of  orbits  in  V. 

Theorem  2.2b.  The  functions  tr(A)  and  tr(B)  are  the  generators  in  [V]*^. 
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Proof.  Let  (A,B)  6  V.  Then  orb(A,B)  contains  a  pair  £(X,C)>  E(ii,  n). 

Then 

E"’(1,d,0,0)  E(X,C)E(1,d,0,0)  =  E(X,Cd) 

L"\l,d,0,0)E(U,n)E(1,d,0,0)  =  E(U,nd) 

Letting  d  0  we  get  that  the  closure  of  orb(A,B)  contains  the  matrices 
E(X,0),  E(|J,0).  Suppose  that  ^(A,B)  6  IV).  Then 

(|>(A,B)  =  (ti(E(X,0),E(M,0))  =  g(X,u)  =  h(tr(A),tr(B)), 
for  some  h  6  C[x,y].  I 

Thus  the  transcendence  degree  of  fV)®  over  C  is  2.  The 
transcendence  degree  of  (V)  over  C  is  3.  More  precisely  we  have 
Theorem  2.27.  The  functions  tr(A),  tr(B)  and  y(A,B)  generate  (V)  and 

they  are  algebraically  independent. 

Proof  We  claim  that 

(2.28)  dim  orb(A,B)  <  2,  (A,B)  6  V. 

Indeed,  suppose  that  (A,B)  is  simultaneously  similar  to  E(X,C),  E(|i, h)* 

Then  E(a,3)  commutes  always  with  E(X, C)  and  E(ii,n).  Moreover  if  either 
C  or  n  ^  0  then  E(a,B)  is  the  only  matrix  which  commute  with  E(X,^)  and 
E(W,n).  This  proves  (2.28).  As  the  dimension  of  V  is  5  the  transcend¬ 
ence  degree  of  (V)^  is  at  most  3.  Clearly  tr(A),  tr(B)  and  Y(A,B)  can 
be  given  any  values  z^,  and  z^.  So  these  functions  are  algebraically 
independent,  i.e.  the  transcendence  degree  of  (V)  is  3.  Let  t(A,B)  G 
(V).  Then  i^(A,B)  is  algebraic  with  respect  to  x^  =  tr(A),  X2  =  tr(B), 

X3  =  Y(A,B) 

m 

(2.29)  4''"  +  },  p^(x^  ,X2,X2)^"*  ^  =  0,  G  C(x^,X2,x^) 

i=1 

be  the  minimal  equation  for  4>  .  So  the  left  hand  side  of  (2.29)  is  an 
irreducible  polynomial  over  C(x^,X2,X3).  But  then  given  the  values  of 
x^,X2,X3  we  l^now  that  orb(A,B)  is  determined  (x^  is  well  defined). 
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Thus,  it  is  defined  on  orb(A,B),  tor  example  p^(x^,X2,X2)  *  0,  then 

has  unique  value.  But  it  is  well  known  that  (2.29)  must  be  m  sheeted 
cover  ot  C.  Whence  m=1  and  ^  is  rational  in  x^,  i=1,2,3.  I 

^Jl^\t  wf  use  the  arguments  ot  Theorem  (2.27)  to  show  that  ^^2^2^ 
generated  by  (}i^(A,B),  i=1,2,3,4,5. 

Q 

Theorem  2.30.  The  functions  (Ji^(A,B),  i=1,2,3,4,5  generate 

are  algebraically  independent.  Moreover  these  functions  generate 
Proof.  Since  the  dimension  of  the  generic  orbit  in  ^2*^2  ^ 

transcendence  degree  of  (M^xM^)  is  5.  As  a  generic  orbit  is  determined  by 
the  values  of  <ti^(A,B),  1  <  i  <  5  we  easily  deduce  that  ij)^(A,B),  i=1,***,5 
are  algebraically  independent.  Therefore  these  functions  form  a  transcend- 
ental  basis  in  (M^xM^)  .  Let  (|)  6  (M2’^2^  .  So  i))  is  algebraic  over 


‘*’r  *“''*’5* 

(2.31  ) 


+  I  Pi  =  0. 

i=1 


Let  (A,B)  <S  U.  Then  orb(A,B)  is  determined  uniquely  by  ,  *  *  ♦, 

Combine  this  with  (2.31)  to  deduce  as  in  the  proof  of  the  previous  theorem 

that  m=1,  i.e.  <|i  is  rational  in  ,  •  *  *, This  shows  that 

generates  (M  xm  )  .  Let  (Ji  e  [M  xm  ]  .  So  (j>  is  rational  in  ip 

Also  41  has  a  finite  value  for  any  41^ ,  '  *  Thus  4>  must  be  a  polynomial 

Q 

in  4>,  ,*••,4',.  That  is  [M  xm  ]  is  generated  by  4>,  ■ 

We  finally  compare  the  results  of  this  section  to  the  classification 
procedure  outlined  in  Section  1.  We  first  note  that  if  A  and  B  are  not 
commuting  then  the  only  non-trivial  solution  X  to  the  system 


(2.32) 


AX-XA  =  0,  BX-XB  =  0 


Ls  X  =  Al.  That  is 


(2.33) 


■  ‘^,2,: 
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We  conjecture 

(2.34)  ”2**^2  ~  ^ 


n  n  ^  -  C  =  W.,  ^  „  ,  P,  >  P.. 

2/2/3/P^  2f2/3/p2  *  ^ 


It  seems  that  for  (A,B)  (=  -  b  all  4x4  minors  of  L(X,Y,A,B)  are 

linearly  independent.  That  is  equivalent  to  the  equality 

f8> 


>,  -  Q)  ■ 


70. 


On  C-  W  the  system  (2.32)  has  two  independent  solutions  X=A,  X=B  and 
X=I  if  A  or  B  are  zero  matrices.  So 


(2.35) 

We  conjecture 

(2.36)  V  -  W  =  M 
Finally 


C  -  W  =  M 


'2,2,2* 


o  1  o  /.  *  ^  C  -  W  =  M~  -  ~  ,  P-  >  P.. 

2f2/2fP^  1  2  2f2f2fP^  J  4 


(2.37)  W  =  M2, 2,0* 

We  now  verify  Conjecture  1.26  for  m=n=2. 

Theorem  2.38.  Let  A,B,  6  Mj  and  assume  r  =  4-v(A,B)  where  v(A,B)  is 
defined  as  in  Section  1.  Let  X  be  the  algebraic  variety  given  by  (1.9) 
(m=n=2).  Then 

X  =  ort)(A,B)  for  (A,B)  e  M^xm^  -  U 

X  =  X^  u  X^,  X^  =  orb(A,B),  dimX2  =  5,  for  (A,B)  c  U  - 

(2.39)  ^  ^  ^  ^2  ^^3'  ”  o>^b(A,B),  X^^^  =  {(X^I,W^I)},  i  =  1,2 

for  (A,B)  e  C  -  V 
X  =  orb(A,B)  for  (A,B)  e  V  -  W, 

X  =  OTb(A,B)  =  {(A,B)},  for  (A,B)  6  W. 

In  particular  Conjecture  1.26  is  valid. 

Proof .  Assume  first  that  (A,B)  C  *'*2*^2  ~  enough  to  consider  the 

case  where  A  has  two  distinct  eigenvalues.  Thus  we  may  assume  that  A=D, 
H=fc'  where  D  and  E  are  of  the  form  (2.4)  and  ^^2^21  ^  particular 


-20- 


rankL(A,B,A,B)  =  3.  Suppose  that 

(2.40)  PX-XU  =  0,  QX-XE  =  0,  P  =  Q  = 

has  a  non-trivial  solution  X  such  that  |x|  =0.  By  considering  the 
matrices 

p^  =  TPT“\  g,  =  Tga”^,  x,  =  tx 


we  may  assume  that  either 

(2.41 ) 


X  -  C  *) 


(2.42)  X  -  (°  1). 

Suppose  that  possibility  (2.41)  holds.  Then  the  first  equation  of  (2.40) 
yields 

(2.43) 

The  second  equation  of  (2.40)  implies  that  e.|2  =  ^  which  is 


•^11  =  S'  P21  =  °'  ^  =  °* 


impossible.  So,  if  X  ^  0  satisfies  (2.40)  then  |x|  *  0.  Hence  (P,Q)  is 
simultaneous  similar  to  (D,E>.  That  is  X  =  orb(A,B)  and  this  orbit  is 
closed. 

Suppose  next  that  (2.42)  holds.  Then  the  first  equation  of  (2.40)  implies 

(2.44)  p^^  =  p^,  =  0. 

The  second  equation  of  (2.40)  yields  82-^  =  0  which  is  impossible.  This 
proves  the  theorem  for  (A,B)  G  -  U.  Assume  now  that  (A,B)  G  U  -  C. 

Again  we  may  assume  that  A  has  two  distinct  eigenvalues.  So  let  A=D,  B=E, 

«12  ®21  =  0- 

Suppose  that  X  is  of  the  form  (2.41).  Then  (2.43)  holds.  Then  the  second 
equality  of  (2.40)  yields  that  e.|2  =  0  which  is  impossible.  Assume  now  that 
X  is  of  the  form  (2.42).  So  the  equality  (2.44)  holds.  Then  the  second 
equality  of  (2.40)  implies 


(2.45) 


‘I1 1  “  ®22  ' 


21 


=  0. 
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We  claim  (P/Q)  is  not  similar  to  (D,E).  Indeed  if  P,y  do  not  commute  then 
P  and  Q  have  one  common  eigenvector  corresponding  to  the  eigenvalues 
and  622  respectively.  Since  the  common  eigenvector  of  D  and  E  corresponds 
to  and  e^^  respectively,  {P,Q)  and  (D,E)  are  not  similar.  Clearly, 

if  P  and  Q  commute  then  (P,Q)  and  (D,E)  are  not  similar.  The  set  of  all 
matrices  (P,U)  satisfying  (2.44)  and  (2.45)  forms  a  manifold  of  dimension  4. 
Consider  all  pairs  (P^/Q|)  simultaneous  similar  to  (P,Q).  As  the  set  of 
matrices  (P,Q)  satisfying  (2.44)  and  (2.45)  is  invariant  under  the 
transformation  (TPT”\  TQT“^),  T  is  an  upper  triangular  matrix.  We  deduce 
that  the  manifold  consisting  of  matrices  (P^,Q^)  is  of  dimension  5.  Thus 

X2  =  closure  X^  is  an  irreducible  variety  of  dimension  5.  This  establishes  the 
theorem  in  this  case.  Note  that  in  this  case 

_  X  0  e  0 

(2.46)  X  n  X  =  orb(D,E  ),  D=(  1  )'  ^  =  (  n  e  )• 

^  ^2  22 

Assume  next  that  (A,B)  6  C  -  V.  So  A  and  B  are  simultaneously  similar  to 
diagonal  matrices  such  that  at  least  one  of  them  has  distinct  eigenvalues.  Again 
we  may  assume  A=D  has  two  distinct  eigenvalues  and  B=E  with  ^■^2  ~  ®21  " 

Let  X  satisfy  (2.40).  Then  as  before  either  (2.41)  and  (2.43)  holds  or  (2.42) 
and  (2.44)  holds.  As  v(A,B)  =  2  we  have  an  additional  linearly  independent 
matrix  Y  satisfying 

(2.40)'  PY-YD  =  0,  QY-YE  =0,  Y  = 

We  also  assume  that  jaX+bY]  =  0  for  all  a  and  b.  By  interchanging  the  roles 


(2.47) 


we  get  3  possibilities 

-  f’  Y  =  M 

(•0  O'’  1  '■0  O'*' 


X2  =  X, 


"3  =  "1'  V  '-"r 

The  choice  X^  and  Y^  is  impossible  since  we  get  X^  =  X^.  The  choice 
^i  +  1'^i  +  1  yields  the  solutions  P  =  X^i  and  Q  =  i=1,2.  So  the 

theorem  is  established  in  this  case. 
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Next  let  (A,B)  S  V-W.  By  considering  the  matrices 
=  oA+gB,  =  YA+6b,  a6-6Y  *  0 

and  making  the  similarity  transformation  (TAT"\  TBT  M  we  may  assume  that 

U  O'! 

(2.48)  ^  =  1-0  X-*'  ®  "  '■0  H-** 

Suppose  that  X  is  of  the  form  (2.41 )•  It  is  easy  to  check  that  it  is 

impossible  to  satisfy  the  first  equation  of  (2.40).  So  X  must  be  of  the  form 
(2.42).  Then  the  linearly  independent  matrix  Y  which  satisfies  (2.40)'  can  be 

chose  of  the  form  Y^  given  by  (2.47). 

So  the  equalities  ( 2. 40) -( 2. 40) '  yield  P  =  Xl,  Q=B.  Since  the  matrices 
(A(a)  =  (q  “),B)  e  orb(A,B) 

for  any  a  ^  0  we  deduce  that  (P»Q)  ^  orb(A,B).  So  orb(A,B)  is  the  only 

irreducible  component  of  the  system  (1.9). 

Assume  finally  that  (A,B)  G  W,  i.e.  A  =  Xl,  B  =  pi.  Then  v(Xl,pI)  =  4 

so  r(Xl,pl)  =  0.  But  then  L(Xq,X^ , Xl,ui)  =  0  if  and  only  if  Xq  =  Xl  and 

Xj  =  Pl.  Thus  we  proved  the  equality  (2.39).  Suppose  (A,B),  (A^,B^) 

6  M  and  assume  that  the  equality  (1.24)  holds.  Then  the  variety  X 

2,2,r,p 

given  by  (1.9)  contains  the  irreducible  varieties  orb(A,B)  and  orb(A^,B^) 
of  dimension  r.  The  equality  (2.39)  yield  that  X  contains  exactly  one 
irreducible  variety  of  dimension  r.  So  orb(A,B)  =  orb(A^,B^).  Now  Theorem 
1.14  (the  equality  (1.16)  yield  that  (A,B)  and  (A^,B^)  are  simultaneously 

similar.  ® 

Remark  2.49.  The  proof  of  Theorem  2.38  yields  the  existence  of  non-similar 

pairs  of  matrices  (A^ ,B^ ), (A2,B2)  such  that 

(2.50)  V(A^,B^)  =  =  ''(Aj  ,B^  .A^fB^)  =  1,  (A^  ,B^  )^A2,B2 )  6  U  -  C. 

This  cannot  happen  in  case  that  m=0  since  the  equality  sign  in  (1.27) 
implies  the  similarity  of  Ag  and  Bg  for  any  dimension  n.  (See  Friedland 
[1980]) . 
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3.  Polynomial  maps  and  varieties. 


0  C*'  ♦  c'',  0  =  (0^(0,  •••,0y(C)),  C  6 
be  a  polynomial  map,  i.e.  0^(0  is  a  polynomial  for  i  =  l,***,v.  In  what 
follows  we  shall  survey  various  properties  of  these  maps  needed  in  this 
paper.  See  van  der  Waerden  [19501,  Shafarevich  [1974]  and  Whitney  [1972)  for 
general  references  on  the  algebraic  and  analytic  properties  of  the  polynomial 
maps.  Specific  results  will  be  given  the  exact  reference.  The  inverse  image 
of  (i>,  i.e.  0  \<u)  is  called  an  algebraic  variety  in  C^.  That  is  any 

algebraic  variety  in  C  is  given  by  some  system  of  polynomial  equations 
0^(0  =  i  =  1,  •*•,'»,  for  some  v.  we  denote  this  algebraic  variety 

by  X.  In  what  follows  all  the  varieties  mentioned  here  are  algebraic.  X  is 

called  reducible  if  X=  X^  u  Xj  where  X^  ^  X  for  i=1,2  and  each  X^  is 
a  variety.  Otherwise  X  called  irreducible.  It  is  well  knovn  that  any  variety 
is  a  finite  union  of  irreducible  ones.  Any  irreducible  variety  is  connected.  A 
point  X  in  an  irreducible  variety  X  is  called  regular  if  in  the  neighbor¬ 
hood  of  this  point,  X  is  a  manifold.  The  dimension  of  this  manifold  does 
not  depend  on  a  choice  of  the  regular  point  and  it  is  called  the  dimension 
of  X.  Let  X^  be  the  set  of  all  regular  points  in  X.  Then  X®  is  an  open 

connected  set  in  X  .  In  particular  X®  is  a  manifold.  Recall  that  Y  is 
an  open  set  in  X  if  y  =  X  n  w  for  some  open  set  W  in  Y  is  called 

algebraically  open  if  W  is  the  set  of  points  for  which  p(C)  ^0,  for  some 


polynomial  p.  Note  that  a  finite  union  of  open  algebraic  sets  u  w 

c^1 

(defined  by  p  ^  0)  is  an  open  algebraic  set  W  given  by  p  =  p,  **•?  . 

01  1  m 

If  Y  is  an  open  algebraic  set  in  irreducible  variety  X  then  Y  is  con¬ 
nected.  Let  X  be  an  irreducible  and  X®  be  the  manifold  of  the  regular 
points  of  X  . 
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Then  X®  IS  an  open  algebraic  set.  That  is  the  set  of  singular  points 

in  X  is  a  subvariety  of  X.  For  a  reducible  variety  X  the  dimension 

of  X  is  defined  as  the  largest  dimension  of  its  irreducible  components.  A 

variety  X  is  called  homogeneous  if  tX  =  X  for  all  t  ♦  0.  In  that 

case  X  is  the  zero  set  of  9^(0  =  0,  i=l,***,v,  where  each  9^(?)  is  a 

homogeneous  polynomial.  For  any  variety  X,  let  X*^  be  the  homogeneous 

variety  given  as  the  closure  of  the  union  of  tX,  for  all  possible  t. 

Clearly  X  is  irreducible  if  and  only  if  X^  is  irreducible.  To  study  the 

homogeneous  varieties  it  is  convenient  to  introduce  the  projective  space 
V- 1  V  r  1 

P  .  It  IS  the  set  C  -  iOl  when  one  identify  C  with  t?  for  any 

t  ^  0.  Thus  any  homogeneous  variety  X  of  dimension  d  gives  rise  to  the 
^  v-1 

projective  variety  X  of  dimension  d-1  in  P  and  vice  versa.  A  well 
known  result  for  projective  varieties  claims  (e.g.  Shafarevich  [1974]). 

A  A 

Theorem  3.1.  Let  X^  and  X^  be  projective  varieties  of  dimensions 

V— 1  ^ 

and  d.>-1  respectively  in  P  .If  d.-td-  >  v+l  then  X,  n  X^  is  a  non- 

1  z  —  12 

empty  projective  variety  at  least  of  dimension  v-d^-d^+l. 

Let  X  be  a  projective  variety  of  dimension  d-1.  Let  Y  be  a 
projective  variety  given  by  the  intersection  of  d-1  hyperplanes 

V 

(3.2)  =  {x,  I  a,  X.  =0,  x  6  p'^“^},  i  =  1,»»»,d-1. 

j=1  ^ 

Then,  according  to  Theorem  3.1  X  n  Y  is  a  non-empty  projective  variety.  We 

v-1 

assume  here  that  ' ’ *  * ®  ^  » *  *  * »d-1 .  Moreover  except  for 


some  variety  Z  in 


v-1, d-1  v-1  v-1 

P  =P  x»»»xp 


X  n  Y  consists  of  exactly  6  points  in  P  ~  .  The  number  5  is  called  the 

A  A 

degree  of  the  variety  X  and  is  denoted  by  deg  X.  In  what  follows  we  need 
the  following  result. 
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Theorem  3.3.  Let  X  be  an  irreducible  projective  variety  in  p“~  of 

dimension  d-1  >  1  and  degree  6.  Let  H, ,  be  any  d-1  hyperplanes 

I  d-1 

v-1 

in  p  .  Let 


(3.4) 


X,  =  X  n  H.=  u  X  ,  d-1 , 

3=1  1=1 


X  ,  •••,X  are  the  irreducible  components  of  X  .  Then 


(3.5) 


dim(X  )  >  d-1-k 


(3.6) 


Proof.  Consider 


X  deg  X  .  <  deg  X  =  6. 
i=1 


X^  =  X  n  . 


If  X_c  then  X^  =  X,  =  1  and  the  theorem  trivially  holds.  If  X 


does  not  lie  entirely  in  then  ^ 


.  1 
X  =  u  X,. 
^  i=l 


dimX^^  =  d-2,  i  = 


..  d-1  .  d-1 

X,  n  H.  =  u(X.,  n  H. ) 


and  for  generic  H  ,  »  X, ,  n  H.  will  consist  of  degX, .  distinct 

2  d- 1  1 1  .  _  3  1 1 

3=2 

points. 

A  A 

Since  X  ^  X  for  i  *  I  for  the  generic  hyperplanes  , 

11  IX  2  0“1 

«  d-1  <.  d-1 

the  sets  X^^  n  and  X^^^  n  must  be  distinct.  Otherwise  X^^  ^  ^1  Jl 

j =2  j®2 

A  A 

will  contain  a  smooth  manifold  of  dimension  d-2  and  thus  X. .  =  X, ,,  as  X  . 

1i  1*  1i 

A  A 

and  X^^  are  irreducible  which  is  impossible.  Hence  in  case  X  ^  we 


deg  X  =  J  degX  . . 

£  _ *  *  X 
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Then 


Consider  now  X  n  n 


V  . 

X  n  n  =(  u  X,j^)n  H^. 
i=l 


Also 


X,  n  =  u  X, . . 
11  2  113 

1=1 


As  dim  X^^  >  d-2  we  have  dimX^^^  >  d-3.  According  to  what  we  proved 


above 


degXii  =  I  degX^^ 


j=1 


So 


(3.7) 


degX  =  I  ^  ^  degX^^.. 

i=,  i=1  j=l  ^ 


Also 


V  u.  « 

"2  = .  >.  ;  ^ir 

1=1  j=1 


Consider  the  decomposition  (3.4)  of  to  irreducible  factors.  Clearly 


X^  =  X..  .  for  some  i»j.  However  if  X,  c  X, .  .  then  X,  will  not 

111  ipq  -  111  ipq 


2a 


appear  in  the  irreducible  decomposition  of  X^.  Thus  the  equality  (3.7)  implies 
the  inequality  (3.6).  This  completes  the  proof  of  the  theorem  for  k=2.  The 
same  arguments  establishes  the  theorem  for  any  3  <  )t  <  d-1.  B 

Remar)^  3.8.  It  is  easy  to  show  that  for  generic  hyperplanes  the 

equality  sign  would  hold  in  (3.5)  and  (3.6).  Also,  the  degrees  of  all  Xj^^ 
must  be  the  same. 


Let  ^  be  the  set  of  n^m  complex  valued  matrices.  We  identify 
with  C*"”.  For  simplicity  of  notations  we  shall  assume 

(3.9)  1  4  m  <  n. 

For  1  <  r  <  m  4  n  let  ^  be  the  set  of  matrices  A  such  that 

n  ,m,  r 

ran)(  A  <  r.  That  is 

(3.10)  M  =  {a,  a[o|6]  =0,  a6Q  ,  ,3eg  ,  }. 

n,m,r  '  '^r+l.n  r+1,m 
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Hence  M  _  is  a  homogeneous  variety.  Thus  we  can  view  M  as,  a 

n,m,r  ’  •'  n,m,r 

projective  variety  in  P*'**'  \  We  claim  that  m  r  irreducible 

variety.  Let  be  a  set  of  all  ACM  with  rankA  =  r.  Clearly 

n,m,r  n,m 

this  set  is  open  in  M„  _  Also  for  A  e 

n,m,r  n,m,r 

(3.11)  A  =  PDj.V,  P  C  GLjj,  Q  C  GL^, 

where  =  1,  i  =  1,***,r  and  all  other  entries  of 

vanish.  As  GL  and  GL  are  connected  manifolds  is  a  connected 

n  m  n,m,r 

manifold.  So  ^  is  irreducible.  We  claim 

n,m(  r 

(3.12)  dim  M_  „  _  =  r(n+m-r)  (r  <  m  <  n). 

ii  f  in  ^ 

Indeed,  pick  up  A^  6  .  Assume  for  simplicity  that  the  first  r 

0  n,m,r 

rows  of  Aq  are  linearly  independent.  Thus,  if  the  first  r  rows  of  A  stay 
in  the  neighborhood  of  the  first  rows  of  Aq  these  rows  will  be  linearly 
independent.  This  gives  rm  independent  parameters.  Now,  if  ^  ®  m  r 
then  any  other  row  of  A  is  an  arbitrary  linear  combination  of  the  first  r 
rows  of  A.  This  gives  us  additional  r(n-r)  parameters. 

So  we  proved  (3.12).  Next  we  claim 

Theorem  3.13.  Let  M„  „  ^  be  the  irreducible  variety  of  all  n’<m  matrices 
— — ....  n,m,r  ■' 

with  the  rank  r  at  most.  Suppose  that  1  <  r  <  m  <  n.  Then 

/  j  r.  /  , « (m-r  )  ( n-r  ) 

(3.14)  degM  =  (r+1 ) 

n  ,m,  r 

To  prove  this  theorem  as  well  as  other  results  we  need  the  following.  Let 
)i  V 

6  :C  C  be  a  polynomial  map.  Denote  by  30(C)  the  Jacobian  of  0  at 
the  point  C«  Let  p  be  the  rank  of  36.  A  point  C  6  is  called  regular  if 
rank36(^)  =  p.  A  point  C  is  called  singular  or  critical  if  rank39((;)  <  p. 

Let  S  be  the  set  of  all  singular  points  of  0.  Clearly  S  is  an  algebraic 
variety.  So  -  S  the  set  of  regular  points  is  an  open  algebraic  set  in 

C  .  Therefore  9(C  -  S)  is  a  connected  manifold  in  C  of  dimension  p. 

Hence,  X  =  c4  9(C^)  is  an  irreducible  variety  of  dimension  P  in  Also 
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0(C^  -  S)  IS  an  open  algebraic  set  in  X  .  So  X  -  0(C^  -  S)  is  a  closed 
subvariety  in  X  ,  Thus,  there  exists  a  non-trivial  polynomial  <  such  that 
(3.15)  <((!»)  =  0 

for  any  a)  fe’  X  -  0(C^  -  5).  In  particular  any  U)  6  X  -  0(C*^)  ( u  is  an 

omitted  value  in  X)  must  satisfy  (3.15).  Consider  y  =  cA  0(S).  It  follows 

that  Y  IS  a  subvariety  of  X .  So  there  exists  another  non-trivial 

polynomial  <  such  that  any  critical  value  o)  of  0  must  satisfy  (3.15). 

bet  0)  be  a  noncritical  value,  i.e.  u)  6  0(C^  -  S) .  Then 

m 


(3.1b) 


0  ^  (ii»)  =  u  Y . 

3  =  1  ' 


where  each  Y^  is  an  irreducible  variety  of  the  dimension  li  -  rank30.  The 
number  m  is  independent  of  the  point  (d  and  is  called  the  degree  of  the  map 
-  deg  0.  The  continuity  argument  implies  that  whenever  we  have  the  decomposi¬ 
tion  (3.16)  to  irreducible  varieties  then  m  <  deg0.  The  map  0  is  called 
regular  if 

rank  30  =  min( y, v) . 


The  arguments  above  show 

u  V 

Theorem  3.17.  Let  0  :  C  C  be  a  polynomial  map.  Let  the  degree  of 

0  -  deg0  -  be  defined  as  above.  Consider  decomposition  (3.16)  of  0  \  u) 

to  its  irreducible  varieties.  Then  m  <  deg0.  If  u)  G  0(C*^)  is  not  a 

critical  value  then  m  =  degO  and  dimY^  =  U  -  rank30,  i  =  !,•••, deg0. 

Assume  that  y  ^  v  and  suppose  that  0  is  a  regular  map.  Then  there  exists 

V  u 

a  non-trivial  polynomial  k  such  that  any  omitted  value  oi  jl  C  -  0(C  )  must 
satisfy  (3.15).  Assume  furthermore  that  y  =  v.  Then  any  non-critical  value 

V 

u)  G  C  is  obtained  exactly  deg0  times. 
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Note  that  if  0  is  onto  map  then  <{u)  can  be  chosen  to  be  =1.  The  most 
interesting  case  is  when  p  =  u.  In  that  case  there  are  simple  sufficient 
conditions  for  0  to  be  an  onto  map.  Let  p(C)  be  a  polynomial  in  U 


variables. 

(3.18) 

Put 

(3.19) 


Denoted  be  deg  p  the  degree  of  p.  Define 
p  (C)  =  lim  ^5"'  »  P  ^ 

Pjj(C)  =  0  if  p  is  a  constant. 

V 

6  =(0  ),  deg 6  =  H  deg  0.. 

w  lit'  'vir'  ^  .. 

1=1 


U  n 

Theorem  3.20.  Let  0  :  C  *  C  be  a  polynomial  map.  Assume  that  the  system 

(3.21)  0^(0  =  0 

has  the  only  solution  C  =  0.  Then  the  system 

(3.22)  0(?)  =  u 

is  always  solvable.  The  number  of  distinct  solutions  of  (3.22)  is  at  most 

u 

deg0.  Moreover,  there  exists  a  non-trivial  polynomial  <(w),  w  6  C  such 

that  the  equation  (3.22)  has  exactly  degO  distinct  solutions  unless  id 

u 

satisfies  (3.15).  That  is  0  is  deg0  covering  of  C  . 

Remark  3.23.  This  theorem  is  essentially  due  to  Noether  and  van  der  Waerden 
[1928].  It  was  rediscovered  by  us  in  Friedland  [1977]  (Theorem  2.1). 

Although  Theorem  2.1  is  stated  in  a  slightly  different  form  we  did  show  in  the 
proof  of  Theorem  2.1  that  the  equation  (3.21)  has  exactly  deg0  distinct 
solutions  unless  u)  is  a  critical  value.  In  that  case  the  arguments 
preceeding  Theorem  3.17  imply  that  u»  must  satisfy  a  non-trivial  equation 
(3.15).  In  fact  k(u)  is  non-constant  if  deg0  >  1. 

Proof  of  Theorem  3.13.  Consider  generic  ^  hyperplanes  where  p  = 

r(n+m-r)-1  in  the  projective  space  So 
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points  {a  ,***jA  }  sweep  an  open  set  in  M  .So  we  may  assume  that 

^  1  ’  q  ^  n,m,r 

H,,***,H  were  chosen  such  that 
'  P 

(i)  are  pairwise  distinct 

(ii)  the  rxr  minors  composed  of  the  first  rows  and  columns  of 

A  .•••,A  are  distinct  from  zero.  Also  (1,1)  entry  of  each  A-  is 

1  q  ^ 

different  from  zero.  As  each  A^  lies  in  the  projective  space  we 
may  assume  that  (1,1)  of  each  A^  is  equal  to  1 . 

Then  A  ,***,A  are  the  solutions  of  the  following  mn  polynomial  equations 

1  q 

whose  variables  are  the  entries  of  X  =  (>^^j)r  i  =  1,***,n,  j  =  1,***,m. 

(I)  all  (r+1)x(r+1)  minors  of  X  which  include  the  first  r  rows  and 
columns  of  X  are  equal  to  zero. 

(II)  H. (X)  =  0,  j  =  1,  ••*,? 

(III)  =  1. 

The  set  of  equations  (I)  consists  of  (m-r)(n-r)  homogeneous  equations  of 

degree  r+1.  Sets  II  +  III  consist  of  r(n+m-r)  linear  equations.  So  we 

have  exactly  mn  equations.  The  left  hand  side  of  I,  II  and  III  defines  a 

map  9  :  C*””  -*■  of  the  degree  6  =  (r+l)^*"  Next  we  show  that  the 

system  (3.21)  has  the  only  solution  X=0.  Otherwise  at  least  one  A^  will 

have  (1,1)  zero  entry  which  contradicts  our  assumption. 

Theorem  3.20.  yields  that  q  <  6.  It  is  left  to  show  that  we  have  the 

equality  sign  q  =  6.  As  M  is  an  irreducible  variety  such  that 

n  ,m,  r 

is  open  in  M  it  follows  that  p  generic  hyperplanes 

n,m,r  n,m,r 

“0 

H  will  intersect  M  transversally.  That  is  w.  may  assume  that 

1  p  n,m,r 

6  IS  regular  at  the  points  A^,»*»,A^.  In  that  case  the  Remar)c  3.23  implies 
that  q=6  and  the  theorem  is  proved.  | 
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In  M 


(m+1  )n'‘,n' 


consider  a  subspace  I-  of  matrices  of  the  form 


L(X„,***,X  ,Xa„,»«»,Xa  )  where  X„,***,X  are  arbitrary  n^n  matrices  and 
0  m  0  m  urn 

X  is  a  complex  parameter.  Let 


(3.24) 


Y  =  M  n  L  ,Y=uy 

(m+1)n  ,n  ,r  j=1  ^ 


where  each  is  an  irreducible  homogeneous  variety.  According  to  Theorem  3.13 

«  2  2 
,  ..  /  ,  V  [  (m+1  )n  -r)[n  -r]  . 

deg  M  2  2  ~  (t+1  )  =  ic(n,m,r). 

(m+1 )n  ,n  ,r 


Next  Theorem  3.3  yields 


(3.25) 


n  <  I  deg  Y.  <  K(n,m,r). 
j  =  1  ^ 


Let  X  t>e  the  variety  given  by  (1.9).  Assume  that  (1.10)  is  its  decom- 
position  to  irreducible  varieties  in  c'  .  Clearly  each  irreducible 

is  obtained  by  restricting  some  Yj  to  the  hyperplane  H  of  matrices  of 


the  form  L(X„,»«»,X  • •• ,A  ).  Since  some  Y.  may  have  an  empty  inter- 

0  m  0  m  j 

section  with  H  we  have  the  inequality  ic  4  n. 


This  establishes  (1.11).  More  precisely.  Theorem  3.3  implies 

K  -h 

(3.26)  I  deg  X.  <  ic{n,m,r). 

0-'  ^  -  h 

Thus  if  we  can  compute  the  degree  of  orb(A„,»»»,A  )  we  could  probably 

0  m 

improve  the  inequality  (1.11)  by  means  of  the  above  inequality. 
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4.  Alqebraic  functions. 


Let  p(X,x)  be  a  polynomial  of  the  form 


(4.1  ) 


p(A,x)  =  X"'  +  1  p.  (x)X‘^ 

i=1  ^ 


As  usual  let  C[X,x]  be  the  ring  of  polynomials  in  X  and  x.  As 


C[X,x)  is  UFD  (unique  factorization  domain)  p(X,x)  decomposes  to 
irreducible  factors 

k  M. 

(4.2)  p(X,x)  =  n  q. (X,x)  , 

i=1  ^ 

where  each  q^(X,x)  is  irreducible  and  q^  and  qj  are  coprime  for  i^j. 
Moreover  since  p(X,x)  is  monic  in  X  each  q^(X,x)  can  be  assumed  to  be 
monic  in  X  of  degree  1  at  least  in  X  variable.  For  this  property  and 
others  consult  for  example  with  Whitney  [1972).  We  call  p(X,x)  degenerated 
if  in  the  decomposition  (4.2)  some  factor  is  repeated  twice  at  least.  That  is 


>  1  for  some  i.  In  what  follows  we  consider  the  roots  of  p(X,x) 


(4.3) 


p(X,x)  =  0. 


Then  each  root  X(x)  is  an  algebraic  function  of  x.  We  can  name  these  roots 

V 

by  A  (x),*»*,X  (x).  Clearly  there  are  two  identical  roots  X. (x)  and 
In  1 

X^(x)  if  and  only  if  p(X,x)  is  degenerated.  For  X^(x)  a  point  C  is 
called  a  regular  point  (point  of  analyticity)  if  X^(x)  is  analytic  in  the 
neighborhood  of  this  point.  Otherwise  C  is  called  a  branch  (singular) 
point.  Let  C  be  a  branch  point.  Then  it  is  possible  to  group  the 
eigenvalues  as  follows 

(4.4)  (A  (x),***,X  (x)},  lx  ^,(x),»**,X  (x),***lX  (x),***,X  (x)} 

1  m,  m.+l  m_+1  m  . +n  m 

11  2  K-1  K 

such  that,  when  we  circle  once  on  the  small  circle  around  C  each  group  of 
eigenvalues  undergoes  a  cyclic  permutation.  For  brevity  each  group  will  be 
called  a  cycle  at  ^  and  the  number  of  elements  of  a  cycle  will  be  called  its 
period.  So  if  the  period  of  a  given  cycle  is  one  then  the  corresponding 


X^(x)  is  analytic  in  the  neighborhood  of  C.  Thus  ^  is  a  branch  point  if 
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there  exist  a  cycle  of  eigenvalues  with  the  period  greater  than  one.  Assume 


that  {X, (x),»**,X  (x)}  form  a  cycle.  Then  we  have  the  Puiseaux  series 
1  m 

e» 

(4.5)  =  I  h=1,***,m, 

j=0  3 

where  w  =  See  for  example  Whitney  [1972,  p.  32]. 

It  is  well  known  that  the  equation  p(X,x)  =  0  has  a  finite  number  of 

branch  points  i>o  each  X.  (x)  is  a  multivalued  analytic  function 

lb  1 

on 

(4.6)  D  =  C  - 

Suppose  that  p(X,x)  is  irreducible.  Then  each  X^(x)  is  n  valued. 
In  other  words  each  X^(x)  is  analytic  uni valued  on  n  cover  sheets  of  D. 
More  precisely,  starting  from  one  branch  of  X^(x)  it  is  possible  to  recover 
all  other  branches  of  X(x)  on  D  by  analytic  continuation.  Using  the 
decomposition  (4.2)  we  can  find  how  many  branches  each  X^(x).  has.  In 
particular,  if  all  the  eigenvalues  X^(x),  i=2,*»*,n  can  be  generated  from 

X^(x)  then  p(X,x)  must  be  irreducible.  Next  we  study  what  happens  when 
X  Assume  that  p(X,x)  is  of  the  form  (4.1).  Define 

(4.7)  6  =  max  deg  p./i» 

1<i<n  ^ 

Divide  the  equation  (4.1)  by  x  to  deduce  that 

(4.8)  |X^(x)|  <  k|x1^  for  jxl  >  r. 

So  at  X  =  •»,  i.e.  |x|  >  r  for  some  large  r,  we  divide  the 

eigenvalues  to  the  cycles  of  the  form  (4.4).  Now  the  group  X  , •••,X  must 

1  m 

have  the  expansion 

(4.9)  I  V.  (jp^x  0  <  a  4  m6. 

j=0  ^ 

The  case  o=0  can  only  correspond  to  the  case  where  m=1  and  X^(x) 
is  a  constant  function. 
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L 


Lemma  4.10.  Let  q(X,x)  be  of  the  form  (4.1) 


(4.11) 


[(X,x)  =  X*”  +  q.(x)X'" 


Assume  that  q(X,x)  divides  p(X,x).  Then 


(4.12) 


deg  q^(x)  <  i6,  i=l,***,m. 


Proof .  It  is  possible  to  rename  the  roots  p(^»x) 


=  0  such 


(4.13) 


q(x)  =  n  (X-X. (x) ). 
i=l  ^ 


According  to  (4.9)  |q^(x)l  <  k|x|^  for  |x|  big  enough.  This  proves 

(4.12).  ■ 

Definition  4.14.  A  point  5  is  called  an  intersection  point  if  either  ^  is 
a  branch  point  or  there  exist  two  distinct  eigenvalues  X^(x)  and  X^(x)  (in 

the  neighborhood  of  5)  analytic  at  C  such  that  X^(t)  =  X^(5)  if  C  is 

X^(x) 

finite.  (In  case  that  ?  =  «>  we  demand  that  lim  -j—. — r  =1.) 

X-*« 

Consider  for  example 
n 

(4.15)  p(X,x)  =  n  [X  -  (a.  +  b.x  ],  r  is  a  positive  integer. 
i=l 

Then  we  do  not  have  any  branch  points  but  there  are  intersection  points 


(4.1fa) 


C..  =  (a.-a.)/(b.-b.),  1  4  i  <  j  <  n 

13  1331 


if  (a.  ,b.  )  *  (a  .  ,b  .  ). 

11  3  3 

Suppose  that  p(X,x)  is  not  degenerate.  In  order  to  find  the  inter¬ 


section  points  we  consider 


(4.17) 


D(p)  =  n  ( X. (x)-X. (x ) )  =  D(x). 

I<i<j4n  ^  ^ 


Since  p(X,x)  is  monic  in  X  it  is  well  known  that  D(p)  is  a  polynomial  in 


p, ,***,p  .  Tne  expansion  (4.9)  implies  that 
1  n 
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(4.18) 


dey  D(x)  <  n(n-1)i 


Clearly  U(x)  =0  if  and  only  if  p(X,x)  is  degenerate.  Thus  if  p(X,x)  is 
nondegenerate  then  a  finite  point  C  is  an  intersection  if  and  only  if 
D(C)  =  0.  We  now  study  the  connection  between  the  multiplicity  of  the  root 
X  =  C  in  D(x)  and  the  nature  of  the  intersection  point  C. 


Theorem  4.19.  Let  C  be  a  finite  intersection  point.  Suppose  that  X^(x), 
•••,X^(x)  brea)c  up  to  the  cycles  as  given  by  (4.4).  Assume  furthermore  that 
X,(C)  =  •••  =  X  (i)  *  \  (C)  =  •••  =  X  (C) 


(4.20) 


’  V  ■  —  -  V  ">■  “e-  “• 

^9-1  ^9 

Then  x=C  is  a  root  of  D(x)  at  least  of  multiplicity 

<  e-1 


(4.21  ) 


L  =  I  (m, -1)  +  2  I 


p=0  )c  +1<i<j4)c 

P  P+1 


min(m.  ,m.  ),  )c„=0,  )c-,=<. 

13  0  o 


Proof .  Assume  that  X^ (x), •••,X^(x)  is  a  cycle  of  period  m  at  x=C. 
Consider  the  product 

f(x)  =  n  (X.(x)  -  X.(x))^. 

I4i<j<m  ^  ^ 

By  completing  one  circle  on  [z-cj  =  r  we  permute  X^,»»*,X^  cyclically  so 
f(x)  is  analytic  and  univalued  in  |z-C|  <  r.  Each  expansion 

(4.5).  So  X. (x)  -  X. (x)  is  divided  by  (x-C)^^™.  Thus  f(x)  is  divided  by 

m^l  li  r 

(x-C)  .  This  shows  that  D(x)  is  divided  by  (x-C)  ,  M  =  ^  (m.-1). 

i=1  ^ 

Assume  now  that  X^^^ (x) , •  •  •, X^^^(x)  is  another  cycle  of  period  i  such  that 
X^(C)  =  X^(C),  i,j=1 

Consider  the  function 


g(x)  = 


1<i4m,m+1<j4m+£ 


(X^(x)  -  X^(x)). 


By  circling  around  the  point  ?  we  permute  the  branches  X  . •••,X  and 

1  m 

X^^^ ,  •  •  • , So  g(x)  remains  univalued  in  the  neighborhood  of  C.  That 
is  g(x)  is  analytic  for  )x-c(  <  t.  without  loss  in  generality  we  may 
assume  that  m  >  A.  Then  g(x)  is  divided  by  [  (x-5 )  ^^"*1  =  (x-C)^. 
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Thus  y^(x)  is  divided  by  (x-C)  •  So  altogether  D(x)  is  divided  by 

(x-t)^  where  L  is  given  by  (4.21).  B 

Theorem  4.22.  Let  C  be  a  finite  intersection  point.  Assume  furthermore 

that  (,  is  a  simple  root  of  L)(x).  Then  C  is  a  branch  point  of  the 

algebraic  function  p(A,x)  =  0  and  the  equation  p(X,C)  =  0  has  exactly 

n-1  distinct  roots.  That  is  at  x=C  one  has  exactly  one  cycle 

{X  (x),X  (x)}  of  period  two  while  all  other  branches  X^(x),***,X  (x)  are 
12  3  n 

analytic  in  the  neighborhood  of  Moreover  in  the  expansion  (4.5)  ^ 

(m=2). 

Proof.  Our  assumptions  imply  that  L  given  by  (4.21)  is  equal  to  1.  So 

=  i,  i=0,***,x  and  m^=2,  ni^=1 ,  i=2,***,K.  Thus  C  is  a  branch  point 

and  p(X,C)  =  0  has  exactly  n-1  distinct  roots.  Furthermore 

00 

(4.23)  (X^(x)-X2(x))^  =  4(x-C)(  \  ^i+1 

j=0 

Thus  is  u^=0  then  x=5  is  a  zero  of  order  3  at  least  of  D(x). 
Therefore  M^^O.  B 

Theorem  4.22  shows  that  the  simple  zeros  of  D(x)  describe  the  simplest 
possible  branch  points  of  p(X,x)  =  0.  Theorem  4.19  enables  us  to  analyze  the 
double  roots  of  D(x). 

Theorem  4.24.  Let  C,  be  a  finite  intersection  point.  Assume  furthermore 

that  5  is  a  double  root  of  D(x).  Then  one  of  the  following  conditions 

holds. 

(i)  p(X,5)  =  0  has  n-1  distinct  roots,  i.e. 

X^(C)  =  2  <  i  <  j  4  n. 

In  that  case  each  X^(x)  is  analytic  in  the  neighborhood  of  C  and 

X;(C)  *  X^(C). 

(ii)  p(X,(;)  =  0  has  n-2  distinct  roots  and  one  of  them  is  triple,  i.e. 

X^  =  (C)  =  X^(5)  =  X^(C),  X^(c)  X^(;)  for  3  4  i  <  j  4n. 
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In  that  case  x  =  C  is  a  branch  point  with  exactly  one  cycle 

(x ) , X^Cx) , X2(x)},  while  all  other  branches  X^(x), •••,X^(x)  are 

analytic.  Moreover  in  the  expansion  (4.5)  (m=3). 

(lii)  p(X,C)  has  n-2  distinct  roots  and  two  of  them  are  double,  i.e. 

X,(C)  =  X^(C),  X  ,(C)  =  X  (C),  X  (C)  *  X  (C)  for  2  <  i  <  j  <  n-1 . 

1  2  n-1  n  1  j 

In  that  case  x=5  is  a  branch  point  with  exactly  two  cycles  {X^ (x), X^Cx) }, 

{X  .(x),X  (x)}  while  all  other  branches  X  (x),*»*,X  (x)  are 

n-1  n  3  n-2 

analytic.  Moreover,  in  the  expansions  (4.5)  for  (X^ (x), X2(x) }  and 

{X  ,(x),X  (x)}  w  *0. 
n-1  n  1 

The  proof  of  Theorem  4.24  is  quite  analogous  to  the  proof  of  Theorem  4.22 
and  uses  only  Theorem  4.19.  So  we  omit  its  proof. 
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b.  Special  polynomials  in  two  variables. 

In  ^his  section  we  consider  polynomials  p(^,x)  of  the  form 
n  ir  . 

(b.1)  p(A,x)  =  ^  p.  {x)X^  P.  (x)  =  I  P.  x^,  i=l,»**,n. 

i=1  ^  ^  j=0  ^ 

Here  r  is  a  positive  integer.  Such  a  polynomial  p(A,x)  is  uniquely  deter¬ 
mined  by  the  coefficient  vector 

^^•2)  P  =  ^PlO'Pl1'***'Plr'*‘*'PnO'**‘'Pn(nr)^- 

We  shall  identify  p{A,x)  with  its  coefficient  vector  and  no  ambiguity 

will  arise.  Thus  any  polynomial  p(A,x)  is  given  by  a  point  p  in 

„n,r  _  1 (n+1 )r+2]n/2 

We  next  consider  the  algebraic  function  A(x)  given  by  the  equation 
p(A,x)  =  0.  In  that  case  6  defined  by  (4.7)  is  at  most  r.  So  each  A(x) 
has  the  Puiseaux  expansion 

(5.3)  A(x)  =  x*^  I  v.x"^/"* 

j=0  ^ 

around  C  =  “.  Here  is  the  root  of  the  equation 

<5.4)  Puir)"""  •  »• 

Thus  5  =  «  is  an  intersection  point  if  (5.4)  has  at  least  one  double 


root.  Let  U(x)  be  given  by  (4.17).  According  to  (4.18)  the  degree  of 
D(x)  is  at  most  n(n-1 )r. 

So 

n(n-1 )r 

/c  »  T  ,  ,  ,  rn(n-1)-i 

(5.5 )  D(x)  =  I  d . (p)x 

i=0  ^ 

where  each  d^(p)  is  a  polynomial  in  the  coefficint  vector  p.  In  particular 

(5.6)  d„(p)  =  n  (V.  -  v.)2 

1<i<j<n  "  3 

r are  the  roots  of  (5.4).  So  ^^(p)  is  a  polynomial  in 


where  v^ , 


,,,***,p  ,  ,  which  is  called  the  discriminant  of 

11  n(nr) 


P 

(5.7) 


n 

p  ( A )  =  +  y  p. , .  .A 

i(ir) 

1=1 


n-i 


Thus  tor  all  p  fe'  such  that  *  0  polynomial  D(x)  is 

exactly  of  degree  rn(n-l).  Let  p(A,x)be  given  by  (4.15).  Then 
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(5.8)  D(x)  =  n  [(b  -  b.)x^  +  (a.-a.))^. 

Suppose  that  b.  *  b.  and  a.  *  a.  for  i^j.  Then  D(x)  is  a 
13  1  3 

polynomial  of  degree  rn(n-l).  Clearly  we  can  choose  (a^,b^),  i=1,***,n 

such  that  we  would  have  exactly  rn(n-1 )/2  distinct  intersection  points 
of  the  form  (4.1b).  In  that  case  each  5^^  is  a  double  root  of  D(x).  We  now 
show  that  there  exist  p(X,x)  of  the  form  (5.1)  for  which  the  polynomial 
D(x)  has  rn(n-1 )  simple  (distinct)  roots.  Let  6(p)  be  the  discriminant 
of  the  polynomial  D(x).  That  is  6(p)  is  given  by  the  well  known  deter- 
minental  formula 


do(p) 


*  *  *  ^rn(n-1)‘P> 


(5.9)  6(p)  = 


dQ(p)  d^(p)  •  •  • 


d  •  •  •  0 

°rn(n-1)(p) 


rn(n-1)dQ(p)  [rn(n-1 )-1 Jd^ (p)  .  .  •  0 


(e.g.  Whitney  (1972,  Appendix  IV]). 


We  claim 


Theorem  5.10.  Let  6(p)  be  a  polynomial  on  given  by  (5.9).  Then 

6(p)  does  not  vanish  identically  on  In  particular,  if  6(p)  ♦  0 

then  D(x)  is  a  polynomial  of  degree  rn(n-1 )  having  rn(n-1  )  simple  roots. 
Proof .  Let  p  be  a  polynomial  of  the  form  (4.15)  such  that  b^  ^  *^j' 
i^j  and  D(x)  has  exactly  rn(n-1 )/2  double  roots  of  the  form  (4.16). 

Let  q  6  p”'*^  be  in  the  neighborhood  of  p.  So  D(x,q)  is  a  polynomial  of 
degree  rn(n-l)  with  the  roots  ^ (n-1  )  ^‘^  ^  continuously  depend 


on  q.  For  q=p 


^21-1^^'  °  ^2i^^’'  i='' »  )/2,  52^(P)  ^ 


for  1  <  i  <  j  4  n(n-1)/2. 
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So  either  we  can  tind  q  such  that  ^  fot  1  <  i  <  j  <  rn(n-l) 

or  we  must  have  (q)  =  ^2^^^  for  all  q  (renaminq  the  indices  if 

necessary).  So  suppose  that  C^(q)  =  and  (p)  =  “ 

[(a^-a2)/(b2-b^  )  *  0.  Choose 

q(A,x)  =  {[A-(a  +b  x'^))fA-(a„+b  x*^))  +  eS)  n  [A-(a  +b  x"^)) 

'  '  ^  ^  3<i<n 


=  {A^-[a  +a.  +  (b  +b_)x’^]A  +  (a  +b  x"^ )  (a  +b  x*^  )+e5  n  (A-(a  +b  x*^)] 

1212  11  ^  ^  3<i<n 

where  e  is  a  small  parameter.  Then  (q)  and  must  be  the  roots  of 

the  equation 

[(a^+a^)  +  (b^+b2)x’^]^  =  4(  (a^+b^x’^)  (a2+b2x’^)  +  e] 
which  is  equivalent  to 

+  2(b -b2)(a^-a2)x’'  +  (a,-a2)^  -  e  =  0. 

But  for  e  ^  0  all  2r  roots  of  the  above  equation  are  distinct.  This  con¬ 
tradicts  our  assumption  that  C^(q)  =  Thus  we  proved  our  assertion 

that  there  exist  q  6  p"'^  in  the  neighborhood  of  a  given  p  such  that 
D(x,q)  has  rn(n-1 )  distinct  roots.  B 

We  now  claim  that  if  6(p)  #  0  then  p(A,x)  is  an  irreducible  poly¬ 
nomial.  This  follows  from  the  following  theorem. 

Theorem  5.11.  Let  p(A,x)  be  a  non-degenerate  reducible  polynomial  of  the 
form  (5.1 ) .  Suppose  that 

p(A,x)  =  q^ ( A,x)q2(A,x),deg  q(A,x)  =  n^  >1,  i=1,2 


(5.12) 


n . 
1 


n .  1  n .  -3 

qi  =  X  W  I  q  (x)A  ^  .  i=1,2. 

3=1 

Then  D(x)  has  at  most  rln(n-1 )-n^n2l  distinct  roots  and  this  number  is 
achievable.  In  particular  any  reducible  polynomial  p(A,x)  has  at  most 


r(n-l)'^  distinct  intersection  points. 
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Proof .  The  equation  p(X,x)  =  0  splits  to  q^ (a,x)  =  0,  q^CS.x)  =  0.  Thus, 
if  C  is  a  finite  intersection  point  for  X(x)  then  one  of  the  following 
conditions  hold 

(i)  a^(C)  =  o^(C)  for  i^j, 

(ii)  6^(0  =  0j(C)  for  i^j 

(iii)  a^(C)  =  0^ (C). 

The  possibility  (i)  means  that  ?  is  an  intersection  point  of  q^(X,x). 
Therefore  we  can  have  at  most  rn^(n^-l)  intersection  points.  In  the  same 
way  possibility  (ii)  can  happen  for  at  most  rnjCnj-l )  distinct  points.  To 
find  out  how  many  distinct  C  may  satisfy  (iii)  we  loo)c  at  the  resultant  of 
q^(X,x)  and  q2(X,x). 

R(x)  =  n  (O  (x)  -  0  (X)). 

Since  q^(X,x)  and  q2(X,x)  are  monic  it  is  well  Icnown  that  R(x)  is 

polynomial  in  the  coefficients  of  X  in  q^(X,x)  and  q2(X,x).  So  R(x)  is 

a  polynomial  in  x.  The  expansion  (5.3)  at  the  infinity  yields  that 
rn  n 

|R(x) I  <  k|x|  ’ 

So  the  degree  of  R  is  at  most  rn.jn2»  Therefore  there  are  at  most 
rn^n2  distinct  C  satisfying  (iii).  Altogether  we  get  that  D(x,p)  has  at 
most  the  following  number  of  distinct  zeros 

r [n, (n^-1 )+n2(n2-1 )+n^n2)  =  r[ (n^+n2) (n^+n2-1 )-n^n2l 
=  r[n(n-1 )~n^n2]. 

Iiet  p(x)  be  of  the  form  (4.15)  such  that  D(x)  has  exactly  rn(n-1)/2 


double  roots.  Define 

n  n 

Pi(X,x)  =  n  [X-(a. +b.  x*^)  I ,  p_(X,x)  =  .n  ,  [X-(a. +b.  x”^)  ]  i  =  n  +1. 

I  .  .  11  £  i=n.+i  11  1 

1=1  1 

Now  we  can  find  q^  in  the  neighborhood  of  p^^  such  that  0(q^)  *  0,  i=1,2. 
Then  each  q^  will  have  rn^(nj-l)  distinct  intersection  points.  As 


/ 


^(x»PlfP2)  exactly  rn^n2  distinct  zeros  the  continuity  argument  implies 

that  (x,q^,q2)  will  have  exactly  >^n^n2  distinct  zero.  Using  the  conti¬ 
nuity  argument  again  we  deduce  that  all  intersection  points  satisfying  (i)-(iii) 
are  pairwise  distinct.  This  shows  that  D(x,q^,q2)  has  r[n^(n^-1) 

+  n2(n2-1)l  simple  roots  and  tn^n2  double  roots.  Clearly 

2 

n (n-1  )  -  n^n^  <  ( n-1 ) 

and  the  equality  holds  it  only  either  *^1=1  or  n^=n-1.  Hence  the  maximal 

2 

number  of  distinct  solutions  of  U(x,p)  =0  is  r(n-1 )  . 

Assume  that  p(X,x)  is  degenerate.  Then  all  the  intersection  points  of 

p(X,x)  are  the  intersection  points  of 

k 

(b.13)  q(X,x)  =  n  q.(X,x) 

i=l  ^ 

where  (4.2)  is  the  decomposition  of  p(X,x)  to  its  irreducible  factors.  As 
deg  q(X,x)  <  n  the  number  of  intersection  points  of  q  is  at  most  r(n-2)  . 
Thus  any  reducible  p(X,x)  has  at  most  r(n-1)  intersection  points.  ■ 
Corollary  5.14.  Let  pCX,x)  be  of  the  form  (5.1).  If  6(p)  *  0  then 
p(X,x)  is  irreducible. 


We  now  study  the  set  of  reducible  and  degenerate  polynomials  p(X,x)  of 
the  form  (5.1)  in  the  coefficient  space  Denote  by  and  the 
subsets  of  corresponding  to  reducible  and  degenerated  polynomials 


respectively.  Clearly  M^g 

Definition  5.15.  Let  M(n.|,n2)  be  a  subset  of  which  corresponds  to 


reducible  polynomial  p(X,x)  of  the  form  (5.12).  Let  N(n^,n2)  be  a  subset 
of  which  corresponds  to  degenerate  polynomials  of  the  form 

(5.16)  q(X,x)  =  q^ (X,x)^q^(X,x),  deg  q^  >  1 

where  q^(X,x)  is  monic  in  X  (so  q^  =  1  if  deg  q^(X,1)  =  n/2). 
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clearly 


(5.17) 


I  ^  =  u  M (n, ,n-n, ), 

f  '  '  O'! 


u  W (n, ,n-2n,  ) 


(5.18)  N(n^,n-2n^)  ^  M(n^,n-n^). 

Theorem  5.19.  The  sets  M(n^,n-n^)  and  W(n^,n-2n^)  are  irreducible 
algebraic  varieties  in  o£  the  dimensions 

dimM(n^ ,n-n^ )  =  {[(n+1)r+2)n  -  2n^ (n-n^  )r}/2 


(5.20) 


dimN(n^ ,n-2n^ )  =  { [ (n+l )r+2]n~n^  [r (4n-5n^ +1 )  +  2] }/2. 


Proof .  The  equality  (5.12)  can  be  repesented  by  a  map 

n, ,r  n  ,r 

(5.21)  0  :  P  X  P  +  P  '  ,  n^+n^  •=  n 

where  is  a  coefficient  vector  in  P  .  Clearly  9  is  a  polynomial 

map.  The  image  of  0  is  exactly  M(n, ,n2).  According  to  Section  3  the 

closure  of  M(n^,n2)  is  an  algebraic  variety  in  We  now  show  that 

M(n^,n2)  is  a  closed  manifold.  Assume  that  we  have  a  sequence  of  polynomials 

p. (A,x)  of  the  form  (5.1)  such  that  p. (X,x)  =  q, . ( X,x)q  ( X,x) ,  degq..(X,1) 

=  n.,  j  =  1,2  and  each  q..(X,x)  is  monic  in  X.  Let  \x  ),•••,  \x) 

J  1  n^ 

and  g^^Nx),  •••,6^^^(x)  be  the  roots  of  q^^(X,x)  =  0  and  q2^(X,x)  =  0 
respectively.  Assume  that 

lim  p, ( X,x)  =  q( X,x). 

i-HJO  ^ 

Then  it  is  possible  to  find  a  subsequence  such  that 


lim  a 


(x)  =  a  (x),  lim  3  (x)  =  3  (x),  1<sVn  ,  1<t<n  . 

®  12 


Hence  (5.12)  holds  where 

"l  "2 

q,(X,x)  =  JI  (X-a  (x)),  q  (X,x)  =  11  (X-3.(x)). 

s=1  t=1 

So  M(n^,n2)  closed.  The  decomposition  (5.17)  yields  that  M  ^  is 


closed.  So  if  p  y  corresponds  to  an  irreducible  polynomial  there 

exists  a  neighborhood  of  p  corresponding  entirely  to  irreducible  poly¬ 
nomials.  Vve  now  prove  the  first  e<iuality  in  (5.20).  Pick  up  irreducible 
polynomials  q^(A,x)  of  degree  n j ,  j=1,2,  such  that  *  q^.  We  claim 

that  0  is  a  local  homomorphism  in  the  neighborhood  of  (qi/q2)*  Indeed 
suppose  that 

p(A,x)  =  u^ ( AjXlu^C A,x)  =  ( A,x)v2( A,x) 

where  (u^ .u^)  and  (v^ ,V2)  are  in  the  neighborhood  of  (q^,q2).  So  u^ 
and  v^  are  irreducible.  Since  C[A,x]  is  a  unique  factorization  domain 
and  u^  and  v^^  are  monic  we  get  either  u^  =  or  u^  =  V2.  The  assump¬ 

tion  that  q^  *  implies  that  u^  *  so  u^  =  and  U2  =  '^2'  Whence 
0  is  a  local  homomorphism  in  the  neighborhood  of  (q^,q2).  Thus 
dimM(n^,n2)  =  {( (n^+1 )r+2]n^  +  [ (n2+1 )r+2]n2}/2 
which  establishes  the  first  inequality  in  (5.20).  It  is  left  to  show 
that  M(n^,n2)  is  an  irreducible  variety.  This  follows  easily  from  the 
tact  that  0  is  a  local  homeomorphism  in  the  neighborhood  of  (q^,q2)  for 

q. 

The  assertions  about  W(n^,2n-2n^)  can  be  proven  in  the  analogous 
way.  ■ 

As 

{ [ (n+1 )r+2]n-2n^  (n-n^ )r}/2  <  {( (n+1 )r+2]n-2(n-1 )r }/2 
{ [ (n+1 )r+2]n-n^ [r (4n-5n^+1 )+2) }/2  <  l[ (n+1 )r+2]n-[4r (n-1 )+2] }/2 
from  the  identities  (5.17)  we  get 

Corollary  5.22.  The  sets  and  are  algebraic  varieties  in 

having  the  following  codimensions 

(5.23)  codim  =  (n-1)r,  codim  =  2r(n-1)+1. 

Moreover  is  an  alijebraic  subvariety  of  '''j-ci* 


-51- 


b.  Irreducible  pencils. 


Let  A,B  e  With  the  pencil  A+xB  we  associate  its  characteristic 


polynomial 


(6.1 ) 


II 

p(A,x,A,b)  =  |Xl-(A+xB)|  »  p^(x)X^ 


p  (x)  =  'I  p  (A,b)x^,  i  =  1,***,n. 
"  D=U 


Clearly  p^j(A,B)  is  a  polynomial  of  degree  (i-j  )  in  the  entries  of  A  and 
of  degree  j  in  the  entries  of  B.  So  Pj^j(A,B)  is  a  polynomial  of  (total) 
degree  i.  Let  (D,E)  be  a  similar  pair  in  (A,B).  Obviously,  the  pencil 
D+xE  has  the  same  characteristic  polynomial.  So  each  P^j(A,B)  are  invar¬ 
iant  polynomials  under  the  action  (TAT~\tbt“^  ) .  In  fact,  for  n=2,  it  is 
easy  to  show  that  the  ring  generated  by  ^^(A,B),  j=0,»*»,i,  i*1,2  is  equal 
to  the  ring  generated  by  4i^(A,B),  i=l,***,5,  given  by  (2.1).  We  now  give 

the  explicit  expression  for  p^j(A,B).  We  use  the  notation  of  Section  1. 

Then  the  coefficients  of  the  characteristic  polynomial  of  A+xB  are  given  by 
p  (x)  =  I  (A+xB)(a|a), 

n 

i,n 

^  e(B,,e_,Y,,Y,) 

Pjjls.ii)  ■  I  (-')  IaiBjITjIIIbiBjITjH, 


(6.21  8,,»,  6  e  Uj,„.  s,  Bj  .  r,  .  ♦, 


B  B  =  Y  Y  =  <x 
1  2  'l  *2 


P  (A,B)  =  I  lA[a|aj(,  p  (A,B)  =  ^ 

„  oeQ 

.  1  ,n  1 


|B[alal 1 , 


Here  ^ ^ '^1 '^2^  ”  this  function  is  completely  determined  by 

^l'^2'^1  ^2  satisfy  the  requirements  (6.2).  Also  for 

a  fe’  y  ,  0  fe’  Q.  we  denote  by  a  n  3  the  common  subsequence  of  a  and 

K  f  n  ^  fT\ 

0.  It  a  n  B  =  t  then  a  U  B  denotes  the  strictly  increasing  sequence 
generated  by  the  elements  of  a  and  &, 
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The  tact  that  with  each  pair  (A,B)  we  associate  a  polynomial  of  the 

form  (b.1)  can  be  put  formally  in  terms  of  a  polynomial  map 

u  V  u  -k  „(n+3)n/2  n,  1 
p;  =P 

n  n 

(b.J) 

p(A,B)  =  (p^q(A,B), •••,p^^(A,B) ). 

We  shall  show  in  the  sequel  that  p  is  a  regular  map  (note  that 

M  H  c"  ). 

n 

first  we  need 

Definition  b.4.  The  £Jencil  A+xB  (the  pair  (A,B))  is  called  reducible 
(degenerated)  if  the  characteristic  polynomial  (6.1)  is  reducible 
(degenerated).  Otherwise  the  pencil  (the  pair)  is  called  irreducible  (non¬ 
degenerate).  The  pencil  A+xB  (the  pair  (A,B))  is  called  symmetric  (real) 
if  A  and  B  are  symmetric  (real). 

We  now  show  that  there  exist  irreducible  pencils.  More  precisely  we  have 
Theorem  6.5.  Let  6(p)  be  a  polynomial  defined  on  the  coefficient  space 
by  (5.9)  (r=1).  Then  6(p(A,B))  is  a  non-trivial  polynomial  on 
^n  *  ^n*  precisely,  there  exist  a  real  symmetric  pair  (A,b)  such  that 

6(p(A,B) )  *  0. 

Proof .  Our  proof  is  very  close  to  the  proof  of  Theorem  5.10.  So  we  point  out 

only  the  additional  arguments  we  have  to  use.  Choose  A  and  B  to  be  real 

diagonal  A  =  diag{a  .•••,a  },  B  =  diag {b, , • • b  }.  Then  the  characteristic 

in  In 

polynomial  of  A+xB  is  of  the  form  (4.15)  with  r=1 .  We  choose  a^  and 
such  that  p(A,x)  will  have  exactly  n(n-1)/2  distinct  intersection 
points.  That  is  D(x)  given  by  (4.17)  (r=1 )  has  exactly  n(n-1)/2  double 
roots.  Suppose  that  6(p(U,E))  =  0.  Then  D(x,D,E)  has  to  have  a  fixed 
double  root  let  us  say  (D,E)  =  such  that 

^,(A,B)  =  =  (a^-a^)/(b2-b^  ). 
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Choose  E  =  ti  and  let  D  be  a  block  diagonal  matrix 

U  =  diagCo  , • , } 

1  n- 1 

where  is  1^1  matrix  i=1,***/n-2  and 


/■a. 


^  = 


e  ^ 


2 


As  in  the  proof  of  Theorem  b.10  we  deduce  that  (D,E)  *  ^^(U,E)  for 

e  #  0.  This  shows  that  6(p(D,E))  cannot  vanish  identically  on  all  real 

symmetric  pairs.  ■ 

According  to  Corollary  5.14  if  6(p(A,B))  *  0  then  the  characteristic 

polynomial  of  A+xB  is  irreducible.  So  "most”  of  the  pencils  A+xB  are 

irreducible.  Consider  a  pair  (A,B).  Assume  that  A  has  n  distinct 

eigenvalues.  Then  A  is  similar  to  a  diagonal  matrix 

(6.6)  D  =  diag{d, , •••,d  }. 

I  n 

So  the  orbit  of  (A,B)  contains  a  pair  of  the  form  (D,E).  Clearly  E 
is  unique  up  to  a  diagonal  similarity  XEX”^  (X  is  diagonal).  Let 


E  =  (e .  ) , .  Denote 
ID  1 


(6.7) 

Definition  6.8.  The  matrix  E  is  called  irreducible  if  the  nonnegative 
matrix  E^  is  irreducible.  That  is,  all  the  entries  of  (I+E^)'’“^  are 
strictly  positive.  (In  other  terms  the  graph  defined  by  E^  is  connected). 

Clearly  the  notion  of  irreducibility  remains  invariant  under  the  diagonal 
similarity.  In  what  follows  we  need  a  criterion  for  the  diagonal  similarity 
(see  Engel-Schneider  [1973]). 


Theorem  6.9.  Let  E,F  6  M^.  Assume  E  is  irreducible.  Then  E  and  F  are 


diagonally  similar  if  and  only  if 


(b.10) 


e  .  =  f..,  i=1,***,n,  e  .  e.  ,  •••e.  .  e.  .  = 

=  f,  .  f.  .  •••f  .  f.  .  ,  j  =  1,***,k,  2<k<n. 

^1^2  ^2^3  ^k-1^k  ^k^l 


On  the  other  hand  if  is  reducible  the  equalities  (6.10)  do  not  imply  the 

diagonal  similarity  of  E  and  F.  We  now  give  a  set  of  invariant  polynomials 

in  [M  X  M  ]  whose  values  determine  uniquely  the  orbit  of  (A,B)  in  case 
n  n 

that  (A,B)  -is  an  irreducible  pair. 

Theorem  6.11.  Let  A,B  e  M^.  The  polynomials 
il  3,  i  j 

(6.12)  tr(A  B  .•••,A  B  ),  m  =  n(n-1),  *^~^'***'® 

are  invariant  polynomials  under  the  simultaneous  similarity.  Moreover  two 
irreducible  pairs  (A,B)  and  (D,E)  are  simultaneously  similar  if  and  only 
if  the  above  polynomials  have  the  same  values  on  these  pairs. 

Broof  The  fact  that  the  polynomials  (6.12)  are  invariant  with  respect  to  the 
action  of  GL^  is  obvious.  Suppose  next  that  A  has  n  distinct  eigen¬ 
values.  Then  the  orbit  of  (A,B)  contains  a  matrix  (D,E)  where  D  is  a 
diagonal  matrix  with  pairwise  distinct  diagonal  entries.  The  entries  of  D 
are  known  since  we  are  given  tr(A^),  i=1,»»»,n.  We  claim  that  E  is 

irreducible.  Otherwise  (e.g.  Gantraacher  [1959))  there  exists  a  permutation 
matrix  P  such  that 


(6.13)  PEP'^  “ 

0  ^22 

Also  PUP*"  is  a  diagonal  matrix  of  the  form  diaglD^jD^}.  But  then 
|Ai-(A+xB)  =  |Ai^-(d^+xF^^ )|  j Al2-(D2+xF22) I 
which  contradicts  the  assumption  that  A+xB  is  an  irreducible  pencil. 

According  to  Theorem  6.9  E  is  determined  up  to  a  diagonal  similarity 
provided  we  can  compute  the  left-hand  side  of  the  equality  (6.10). 

Let 
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D.  =  diag{5. , , 5.  )• 

1  n  in 

Then  a  straightforward  calculation  shows  that 


(fe.14) 


=  tr(D^E) 


e.  .  e.  .  •••  e.  .  e.  .  =  tr(D.  ED.  E). 

^1^2  ^2^3  ^k-1^k  ^k^l  ^1  ^2  ^k 


Since  all  the  eigenvalues  of  D  are  pairwise  distinct  we  have 

(X-X  ) 

D.  =  p.(D),  P  (X)  =  11 

:^i  1  : 


So  the  right  hand  side  of  (6.14)  contains  expressions  of  the  form 

i  i  A 

12  k 

(6.15)  tr(D  ED  E,***,D  E),  0<A^<n-1,  i=1,»»»,k,  l<k<n-1. 

Substituting  A  for  D  and  B  for  E  we  realize  that  the  above  expression 
are  included  in  the  expressions  appearing  in  (6.12).  This  proves  the  theorem 
in  case  that  A  has  a  distinct  eigenvlaue.  In  the  general  case  let  Aq  = 
A-fx^B. 

Clearly  we  can  choose  Xq  such  that  Aq  has  a  distinct  eigenvalue. 

Then  in  the  expression  (6.15)  we  have  to  substitute  A'fXQB  for  D  and  B 
for  E.  A  straightforward  calculation  also  shows  that  all  possible  expres¬ 
sions  in  terms  of  A  and  B  which  may  appear  in  (6.15)  are  listed  in 
(6.12).  The  proof  of  the  theorem  is  completed.  ■ 

In  fact  we  proved  a  more  precise  statement. 

Theorem  6.16.  Let  A  be  a  diagonal  matrix  with  pairwise  distinct  elements 
and  B  is  irreducible.  Then  the  orbit  of  (A,B)  is  determined  uniquely  by 
the  values  (6.12). 

The  distinction  between  Theorems  6.11  and  6.16  becomes  apparent  for 
n  >  3.  Indeed,  if  A  is  a  diagonal  matrix  with  pairwise  distinct  elements 
and  A+xB  is  irreducible  then  B  is  an  irreducible  matrix.  However,  for 


n  >  3  there  exists  A  and  B  satisfying  the  assumptions  of  Theorem  6.16 
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’1 


such  that  A+xB  is  a  reducible  pencil  (e.g.  Friedland-Simon  [1981]).  In 
fact,  the  assumption  that  any  pencil  satisfying  the  assumption  of  Theorem  6.16 
is  irreducible,  implies  a  false  conjecture  in  (Avron-Simon  [1974],  Friedland- 
Simon  [1981]). 

Thus  we  proved 

Theorem  6.17.  The  values  of  the  invariant  functions  (6.12)  separates  all 
orbits  (A,B)  if  A  and  B  does  not  have  a  common  subspace  and  A+xB  is 
not  a  degenerated  pencil. 

Let  D  be  a  diagonal  matrix  with  pairwise  distinct  diagonal  entries. 
Consider  a  matrix  C  =  (c^  •)^,  such  that  c. .  ^0,  i=2,***,n.  Then  we 

can  find  a  diagonal  matrix  X  such  that  E  =  XCX'^  ^ 
i  =  2,  “^n. 

In  fact  X  is  unique  up  to  a  multiplication  by 
"general"  we  can  find  a  unique  pair 

(D,E),  D  =  diaq{d, , ♦••,d  },  d.  *  d. 

’1  n  1  j 

(6.18) 

E  =  (e.  . )",  e^.  =  1,  i=2.***,n, 

in  the  orbit  of  (A,B).  That  is  if  orb(A^,B^)  contains  pairs  of  (D^,E^) 
of  the  form  (6.18)  then  orb(A^,B^)  =  orb(A2,B2)  if  and  only  if  (D^,E^)  = 
(02,82).  That  shows  that  the  "generic"  orbits  are  parametrized  by  n  +1 
pjarameters.  If  the  entries  of  the  matrix  E  given  in  (6.18)  are  all  distinct 
from  zero  then  orb(D,E)  is  fixed  by  the  values  of  polynomials  (6.12)  in  view 
of  Theorem  6.11.  So  the  ring  of  invariant  polynomials  [M^  x  M^l  separates 
between  the  "generic"  orbits.  That  is 

Theorem  6.19.  The  transendence  dimension  of  the  ring  of  invariant  polynomial 

[M  X  M  is  n^+1. 
n  n 

Q 

Vue  were  not  able  to  find  a  simple  transendence  basis  in  [M  x  m  ]  for 

n  n 

a  general  n. 


ij  1  11 

X  ^  0.  That  is  in 

for  i  ^  j. 
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We  now  show  that  the  functions  (b. 12)  generate  the  field  of  invariants 


(M  X  M  )^. 
n  n 

Theorem  b.20.  The  invariant  polynomials  given  in  (6.12)  generate  tne  field  of 

vG 

rational  invariant  functions  (M  x  m  )  . 

n  n 

Proof.  Let  {k=2^^^“^^)  be  the  invariant  polynomials  given  in 

(6.12).  In  Theorem  6.11  we  showed  that  the  values  of  these  polynomials 
determine  the  orbit  of  an  irreducible  pair  (A,B).  Since  the  set  of 
irreducible  pairs  (A,B)  form  an  open  (algebraic)  set  in  x  it  follows 

Q 

that  form  a  transcendal  basis  in  (M  x  m  )  .  That  is  any  0  6 

1  )c  n  n 

Q 

(M  X  M  )  satisfied  the  equation 
n  n 

m 

(6.21)  e"  +  =  0. 

i=0 

Let  X  be  the  polynomial  map  given  by  =  ('^^ .  •**# '^j^)* 

Denote  by  V  the  closure  of  '^(M  x  m  ),  So  0  is  an  algebraic  function 

n  n 

on  V.  On  the  other  hand  let  V®  be  the  set  of  all  i^(A,B)  such  that 
(A,B)  is  an  irreducible  pair.  So  V®  is  an  open  set  in  V.  Since  for 
any  6  V^,  '^(A,B)  determines  the  orb(A.B)  we  see  that  0(v’(A,B)) 

is  defined  uniquely.  So  the  minimal  m  in  (6.22)  can  k)e  chosen  to  be  1. 

That  is  0  is  rational  in  '  ***''^)j*  Hence  generates  the  field 

(Mn  X  ■ 

In  conclusion  we  note  that  according  to  Process!  [1976]  the  invariant 

Q 

polynomials  (6.12)  generate  the  ring  of  invariant  polynomials  [M  xm  ]  . 

n  n 
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7.  Synunetric  pairs 


In  this  section  we  are  qoinq  to  consider  (complex)  symmetric  pairs  of 
matrices  (A,B).  Two  pairs  of  symmetric  matrices  (A,B)  and  (D,E)  are 
called  orthoqonally  similar  if  there  exists  (complex)  orthoqonal  matrix  0 
such  that 

(7.1)  D  =  O^AO,  E  =  O^BO. 


Denote  by  S^(S^(R))  the  set  of  n^n  complex  (real)  valued  matrices.  So  the 

complex  (real)  orthoqonal  qroup  ^  acts  in  the  above  form  on 

S  *  S  (S  (R)  ^  S  (R)).  Suppose  that  A  is  symmetric  and  has  n  distinct 
n  n  n  n 

eiqenvalues.  Then  there  exists  an  orthogonal  matrix  O  such  that  D  =  O^AO 
is  a  diagonal  matrix.  Thus  the  orbit  of  (A,B)  contains  a  symmetric  pair 
(D,E).  We  claim  that  E  is  fixed  up  to  an  action  of  the  following  finite 
orthoqonal  qroup, 

(7.2)  DO  =  {d|d  =  diag{l,d-, ‘‘Sd  },  d.  <  1,  i  =  2,*»*,n}. 

n  2  n  1 

Lemma  7.3.  Let  E,F  €  Assume  that  E  and  F  are  diagonally  similar. 

Then  there  exists  D  €  DO^  such  that  F  =  DED. 

Proof,  Suppose  that  F  =  DED”^,  D  =  diag (d , , • • ‘.d  }.  Assume  that  e. .  ^0, 
-  In  13 

then 


f  =de  d^=t  =de  d^. 

ij  i  ij  j  ji  j  ji  i 
2  2 

That  IS  d^  ~  j *  Suppose  that  E  is  irreducible.  We  then  deduce 
2  2 

d,  =  •••  =  d  .  Clearly  we  can  choose  d,  =  1.  This  shows  that  D  G  OD„. 
In  1  n 

Suppose  that  E  is  reducible.  Since  E  is  symmetric  there  exists  a 
permutation  matrix  P  such  that 

P^EP  =  diaglE^,***,E^} 
where  each  E^  is  irreducible.  But  then 

P*^FP  =  diaq{F^  ,»**,Fj^},  P^P  =  p’^DP(p’^EP)P^d"^P 


So  each  is  a  diagonally  similar  to  an  Now  use  the  above  argument  to 

show  that  D  can  be  chosen  to  be  in  DO„.  I 

n 

Thus  if  A  6  S^  has  n  distinct  eigenvalues  a  pair  (D,E)  essentially 

parametrize  (up  to  the  action  of  orbit  of  (A,B).  Hence  "most  of 

the  orbits"  are  parametrized  by  n  +  n(n+l)/2  parameters.  That  is  the 

transcendence  degree  of  (S  x  s  is  (n+3)n/2. 

n  n 

It  is  easy  to  find  a  transcendence  basis  in  [S  x  s  ]^. 

n  n 

Theorem  7.4.  The  polynomials 

(7.5)  tr(A^),  i  =  n,  tr(A^BA^B),  0  <  i  <  j  <  n-1 

from  a  transcendence  basis  in  [S  x  S  Moreover,  if  A  has  n  distinct 

n  n 

,  2_ 

eigenvalues  then  the  value  of  these  polynomials  determine  at  most  2' 
n+2)/2  distinct  orbits. 

Proof .  First  note  that  tr(A^BA3B]  =  tr(BA3BA^),  for  any  pair  (i,j).  As 
tr(A^),  i  =  1,**»,n  are  given  we  know  the  characteristic  polynomial  of  A. 
Using  the  values  of  the  polynomials  in  (7.5)  and  the  remark  above  we  can  com¬ 
pute  any  polynomial  of  the  form  tr(A^BA3B).  Assume  next  that  A  has  n 
distinct  eigenvalues.  So  the  orbit  of  (A,B)  contains  a  symmetric  pair 
(D,E)  where  D  is  a  diagonal  matrix  with  pairwise  distinct  diagonal  entries. 
The  entries  of  D  are  known  since  we  are  given  tr(A^),  i  =  1,»**,n.  Using 
the  arguments  of  the  proof  of  Theorem  (6.11)  we  deduce  that  we  can  compute 


the  products  1  <  i,  j  <  n  in  terms  of  the  given 

polynomials  (7.5).  As  E  is  symmetric  it  follows  that  we  know  the  values  of 
e^j.  So  each  e^^^j  is  fixed  up  to  ±1.  There  are  at  most  ■^(n-¥y)/2 
different  matrices  £.  However  (U,E)  and  (D,XEX)  are  in  the  same  orbit 
for  any  X  6  DO^^.  Thus  we  have  at  most  2^”'*’^  )n/2-(n-1 )  distinct  orbits 


corresponding  to  the  values  of  the  polynomials  in  (7.5).  In  fact,  if  all 


e^j  are  different  from  zero  then  the  knowledge  of  D  and  all  e^.  gives 
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distinct  orbits,  provided  that  U  has  pairwise 


rise  to  exactly  2^*^  -n+2)/2 

distinct  entries.  Since  the  orbit  space  of  symmetric  pairs  (A,B)  is 

parametrized  by  (n+3)n/2  parameters  we  see  that  the  £X)lynomials  (7.S)  form  a 

transcendence  basis  in  [S  x  y  j®,  H 

n  n 

Combine  the  arguments  of  the  proofs  of  Theorems  7.4  and  6.17  to  get 
Theorem  7.6.  Let  (A,B)  and  (D,E)  be  non-degenerate  symmetric  paris. 

Assume  that  neither  A  and  B  nor  D  and  E  have  a  common  subspace.  Then 
these  pairs  are  orthogonally  similar  if  and  only  if  the  polynomials  given  by 
(6.12)  have  the  same  values  on  these  pairs.  ■ 

We  now  classify  the  orbits  of  2*2  symmetric  pairs  under  the  orthogonal 
similarity.  Let  U  and  V  be  subvarieties  of  S,  x  given  by  (2.3) 
and  (2.8)  respectively.  Suppose  that  (A,B)  lies  in  U  but  (A,B)  jS  V. 

Then  the  arguments  of  Theorem  2.9  show  that  (A,b)  are  orthogonally  similar 
to  a  pair  of  diagonal  matrices  (D,F).  Whence  it  follows 

Theorem  7.7.  Let  (A,B)  «  S^.  Assume  that  (A,B)  jS  V  (i.e.  (A,B)  is 

non-degenerate)  then  the  values  of  the  polynomials  i^j^(A,B),  1  <  i  <  5 

determines  a  unique  orbit  under  the  action  of  02* 

Suppose  that  C  e  S2  and  C  has  one  multiple  eigenvalue.  It  is  easy  to 
show  that  either  C  is  of  the  form 


(7.8) 


C(X,a) 


X+a  ai 

ai  A-a 


or  C  =  C(A,-a).  Moreover  if  a  ^  0  then  C(A,a)  is  orthogonally  similar  to 
any  C(A,b)  or  C(A,-b),  b  ^  0.  Thus  if  (A,B)  6  V  we  have  that 
A  =  C(A,±a),  B  =  C(h,±b). 

Therefore  t)ie  function  Y(A,B)  given  by  (2.24)  is  equal  to  ±a/b  and  is 
a  nontrivial  function  in  (V)  So  Theorem  2.23  applies  to  S^  ^  S^. 
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Theorem  7.y.  Let  (A,B)  6  V  ^2  *  ^2*  Y(A,B)  =  to 

O9 

(V)  If  either  Y(A,B)  or  1/Y(A,B)  is  defined  on  the  orbit  of  (A,B) 

then  the  values  of  tr(A),  tr(B)  and  y(A,B) ( 1/Y(A,B) )  determine  a  unique 

orbit  in  V  under  the  action  of  02«  Otherwise  the  orbit  of  (A,B)  consists 

^  rtr(A)  ,  tr(B) 

of  one  point  [, — - —  I,  —  Ij. 

The  disadvantage  of  the  transcendence  basis  (7.5)  of  that 

these  polynomials  are  not  symmetric  with  respect  to  A  and  B.  The  natural 

candidate  for  symmetric  basis  in  [S  x  S  )^  are  the  coefficients  of  the 

n  n 

characteristic  polynomial  of  the  pencil  A-fxB.  Indeed,  the  map  (6.3) 

restricted  to  S  x  s  yields 
n  n 

(7.10)  p  :  S  X  s  ♦  ^(n+^)n/2  ^ 

n  n 

Thus,  if  p,,.(A,B), ***,p  (A,B)  are  algebraically  independent  it  follows  that 

'  u  nn 

these  polynomials  form  a  transcendenal  basis  in  [S  x  s  since  the  trans- 

n  n 

cendence  degree  of  (S  x  s  is  (n+3)n/2.  Also  the  polynomials 

n  n 

p^j(A,B)  do  exhibit  symmetricity  in  A  and  B  since 

(7.11)  P^j(A,B)  =  0<j<i,  0<i<n. 

Clearly,  p^j(A,B),  0<j<i,  0<i<n  form  a  transcendence  basis  in 

[S  X  s  ]  if  and  only  if  the  map  (7.10)  is  regular.  If  we  can  show  that 
n  n 

p  is  onto  map  then  of  course  p  is  regular.  Suppose  that  we  showed  that 
p  is  an  onto  map  for  )c  <  n-1 .  Thus  if  p(X,x)  is  a  reducible  polynomial  of 
the  form  (5.1)  then  there  exists  a  pair  of  symmetric  block  diagonal  matrices 
(A,B), 

A  =  diag{A^,A2},  B  =  diag{B^,B2) 

sue  1  that 

I Al  -  (A+xB)|  =  p(X,x). 

So,  in  order  to  prove  that  p  is  an  onto  map,  it  is  enough  to  show  that  any 
irreducible  polynomial  p(X,x)  of  the  form  (5.1)  is  a  characteristic  of  some 


-62- 


it  is  enough 


syiiunetric  pencil  A+xB,  By  making  a  transtormation  x  =  y  +  Xq 

to  consider  irreducible  polynomials  with  the  property 

n 

(7.12)  p(X,0)  =  n  (l-d),  d.  *  d.  for  i  ^  j. 

i=1  ^  ^  ^ 

If  I \l  -  (A+xB)|  =  p(A,x)  then  we  find  a  symmetric  pair  (D,E) 

U  =  diagid ^ • ,d^}  such  that  | Xl  -  (D+xE) |  =  p(X,x). 

Thus,  if  p  is  onto  map  then  the  map 

(7.13)  p(b,  •)  :  S*^  +  b"'’ 

P(D,E)  =  (p^^(D,E),***,p^_^  (D,E),  •••,p^^(D,E)) 

is  an  onto  map.  Clearly,  Pjj^(D,E)  is  a  homogeneous  polynomial  of  degree 
i  in  the  entries  of  E.  According  to  Theorem  3.20  if  the  system 

(7.14)  Pj^j(D,E)  =0,  j  =  1,*»*,i,  i  =  1,***,n 

has  a  unique  solution  E  =  0  then  the  map  (7.13)  is  an  onto  map.  The  system 

(7.14)  is  equivalent  to  the  assertion  that  the  pencil  D+xE  has  constant 

eigenvalues  (spectrum)  d, ,***,d  .  The  main  result  of  the  next  section  is 

1  n 

Theorem  7.15.  Let  D  be  a  diagonal  marix  with  pairwise  distinct  diagonal 
entries.  Assume  that  the  symmetric  pencil  D+xE  has  a  constant  spectrum. 

Then  E  =  0  for  1  <  n  <  4.  For  n  >  5  and  a  given  D  there  exist 
nontrivial  E  satisfying  (7.14). 
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8.  Polynomial  matrices  with  a  constant  spectrum 


In  what  follows  we  adopt  the  following  notation.  By  Mj^(C[xl)  we  denote 

n*n  matrices  with  polynomial  entries.  That  is  if  A(x)  S  M^(Clx])  then 

m 

(8.1)  A(x)  =  (a..(x))”  =  I  A.x^,  A.  t»M. 

ID  1  ^  D  n 

Let  GL^(C[x])  be  the  general  linear  group  over  the  ring  C[x].  That  is 
U(x)  G  M^(C[x])  is  in  GLjj(C(x])  if  and  only  if  there  exists  V(x)  6 
Mn(C[x))  such  that  U(x)V(x)  =  I.  Denote  by  Oj^(C[xl)  the  orthogonal  sub¬ 
group  of  Mjj(C[x]).  That  is  0^(C[x])  consists  of  all  U(x)  such  that 
U(x)U^(x)  =  I.  Denote  by  Sj^(C[x])  and  Ajj(C(x])  the  subsets  of  symmetric 
and  skew  symmetric  matrices  in  M^(C[x)  respectively.  Here  A^^  denotes  the 
set  of  skew  symmetric  matrices  (A^  =  -A)  in  Let  F(x)  S  Mj^(C[xl)  and 

consider  the  differential  equation 

(8.2)  S  ° 

with  the  initial  condition  U(0)  =  I.  Then  U(x)  is  invertible  for  each 
X.  In  fact  it  is  easy  to  see  that  V  =  0“^(x)  satisfies  the  equation 

(8.3)  ^=-F(x)V 

with  V(0)  =  I.  Clearly,  U(x)  is  orthogonal  for  each  x  if  and  only  if 
F(x)  is  a  skew  symmetric  matrix.  However,  usually  U(x)  will  not  belong 


to  Mj^(C(x]). 

Theorem  8.4.  Let  U(x)  G  GLjj(C[x]).  Then  U(x)  satisfies  the  differential 

equation  (8.2)  with  F(x)  G  M^(C[x]) 

i 

(8.5)  F(x)  =  I  F  x^. 

j=0  J 

Moreover  F^^  must  be  nilpotent. 

Proof.  Define 

F  X  =  U  —  . 

dx 

So  F(x)  G  M^(C[x])  and  U  satisfies  (8.2). 
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tiuppose  that  F  is  of  the  form  <8. 5).  Then  around  x  = 


00 


the  leading 


part  of  (8.2)  reduces  to 


Z+1 


Thus  around  x  =  ”“  the  solution  U(x)  behaves  as  U(C)exp(Fj^x  /^+^) 


(  |C|  »  1). 

It  F.  is  not  nilpotent  some  entries  of  U(x)  behave  at  infinity  as 
H+l 

Px  ^ 

e  (P  ^  0).  8ince  U(x)  is  a  polynomial  in  x  we  deduce  that  F^^ 

IS  nilpotent.  (See  for  example  Lutz  [1967]  for  the  precise  version  of  this 
result).  ■ 

Let  E  be  a  nilpotent  matrix  then 
(8.6)  U(x)  =  p(x)  G  C(x]. 

belongs  to  GL^(C(xJ).  In  fact  we  have 

Theorem  8. 7  The  group  GL^(C[x])  is  generated  by  GL^^  and  the  matrices  of 
the  form  (8.6). 

Proof.  Let  U(x)  G  GL^(C[xl).  It  is  well  known  (e.g.  Gantmacher  (1959,  ch. 

6j )  that  U(x)  can  be  brought  by  the  elementary  operation  to  its  Smith 
normal  form  which  is  the  identity  matrix  I.  Each  such  elementary  operation 
is  carried  out  by  multiplications  from  left  or  right  by  the  following  types  of 
martices  A(x) 


(1) 

(ii) 


A(x)  =  A  6  GL  , 
n 

A(x)  =  I  +  p(x)E^^  =  exp(p(x)E^^ ) ,  i  *  j, 


where  E^^  is  the  matrix  whose  (i,j)  entry  is  1  and  all  other  entries 
2 

vanish.  So  E^^  =  0.  This  proves  the  theorem.  ■ 

Suppose  that  E  G  A^  (the  set  of  skew  symmetric  matrices)  and  is 
nilpotent.  Then  the  matrix  U(x)  given  by  (8.6)  is  orthogonal.  Clearly 
for  n=2  the  only  skew  symmetric  nilpotent  matrix  is  the  zero  matrix. 
Therefore,  if  U(x)G  U2(C[x|)  the  corresponding  F(x)  in  (8.2)  must  be  zero 
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matrix  in  the  virtue  of  Theorem  8.4.  That  is 


(8.8)  02{C(x])  =  Oj. 

This  also  follows  from  the  simple  fact  that  the  squares  of  the  elements  in 
each  row  of  an  orthogonal  matrix  U(x)  sum  to  1.  Indeed  the  equality 
1  =  u^(x)  +  v^{x)  =  (u(x)  +  iv(x))(u(x)  -  iv(x)) 
imply  that  u(x)  and  v(x)  are  constants  if  u(x)  and  v(x)  are 
polynomials.  For  n  >  3  there  are  non-zero  skew  symmetric  matrices.  For 
example 


(8.9) 


E  =  diag(H,0),  H  = 


0  1+i  0 

-1-i  0  -1+i 

0  1-i  0 


So  we  pose  an  obvious  problem 

Problem  8.10.  Is  Oj^(C[xl)  generated  by  and  the  matrices  of  the  form 

(8.6)  (E^  =  -E)  for  n  >  37 

Let  A(x)  6  Mj^(C[xl).  A(x)  is  called  rank  1  matrix  if  A(x)  ^  0  and 

all  2*2  minors  of  A  vanish  identically.  Suppose  that  A  6  is  rank  one 

matrix.  Then  A  =  (u.v.)*',  where  u  =  (u,,***,u  )^  and  v  =  (v  -•••,v  )^ 

i  j  1  In  In 

t 

span  the  ranges  of  A  and  A  respectively.  For  a  rank  1  matrix  A(x)  we 
have  a  similar  result. 

Theorem  8.11.  Let  A(x)  6  M^(C(x])  be  rank  1  matrix.  Then  there  exists 

polynomials  a(x)/  u,(x),***,u  (x),  v  (x),»**,v  (x)  such  that 

1  n  1  n 

(8.12)  A(x)  =  (a^^(x))",  aj^j(x)  =  a(x)u^  (x)Vj  (x),  i,j  =  1,***,n 

and  u^,  i  =  1,***,n  (v^,  i  =  1f***»n)  do  not  have  common  zeros.  Moreover 
if  A(x)  is  symmetric  it  is  possible  to  choose  u^(x)  =  v^(x),  i  =  1,***,n. 
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t'root .  Choose  5  and  constant  vectors  a  and  0  such  that  A(^)a  ^  0, 

0.  So  A{x)a  =  (a  (x)(u  (x),»*»,u  (x))*^  A^(x)3  = 

11  n 

a^(x)(v  (x),***,v  (x))*"  where  a^(x),  u.(x)  and  v-(x)  are  polynomials 
2  1  n  1  j  J 

such  that  u,,***,u  (v.,***,v  )  do  not  have  a  common  zero.  For  x  fixed 

1  n  1  n 

in  the  neighborhood  of  C  A(x)  is  a  rank  1  matrix.  So  A(x)  = 

(a(x)u. (x)v . {x) where  a(x)  is  a  rational  function.  It  is  left  to  show 

1  :  1 

that  a(x)  is  polynomial.  SupfXDse  that  a(x)  =  b(x)/c(x)  and  b(n)  ^  0, 

c(n)  =  0.  As  A(x)  G  Mj^(C[x])  u^(n)  v^(n)  =  O  for  i,j  =  !,•••, n. 

Since  u^(x),  i  =  l,***,n  do  not  have  a  common  zero  there  exists 

1  <  i  <  n  such  that  u^(n)  *  0.  Hence  v^Ch)  =0,  j  =  1,***,n,  which 

contradicts  our  assumption  that  v  (x),***,v  (x)  do  not  have  a  common  zero. 

1  n 

Hence  a(x)  is  a  polynomial.  Suppose  that  A(x)  is  symmetric.  Then  we  can 
choose  3  =  Cl  so  v(x)  =  u(x).  ■ 

L<et  U{x)  G  GLjj(C[x]),  such  that  U(0)  =  I.  Define 
(8.13)  A(x)  =  U(x)AyU”'' (x). 

Then  the  eigenvalues  of  A(x)  are  constant  (do  not  depend  on  x).  The 
converse  of  this  statement  is  true  if  A(x)  has  n  distinct  eigenvalues. 


Theorem  8.14.  Let  A(x)  G  M^(C[x]).  Assume  that  A(x)  has  constant  pairwise 
distinct  eigenvalues.  Then  (8.13)  holds  with  U(x)  G  GLj^(C[x])  and  U(0)  = 

1.  It  in  addition  A(x)  is  symmetric  then  U(x)  G  0^(C[x]). 

Proof .  By  considering  the  matrices  TA(x)T~\  T  6  GL^^  we  may  assume  that 
A  =  D  =  diagid  .•••,d  }.  As  A(x)  has  a  constant  spectrum  each  d,  is  an 


eigenvalue  of  A(x).  Let 


(8.15)  P.(x)=  11  Id.  I  -  A(x)),  (d.-d.  ). 

^  1«!j<n,j^i  ^  ^  ^ 

So  Pj^(x)  IS  rank  one  matrix  with  tr(P^(x))  =  1.  Theorem  8.11 


yields 


P^(x)  =  (u;(x)v^(x))^^^^^ 


I  u^ (x)v. (x)  =  1. 
3  =  1  '  ' 
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•  •  • 


Clearly  u^(x) 
eigenvectors  of 
value  d^.  As 


=  (u^(x), 
A(x)  and 


,u;;(x)) 

a’^Cx) 


d.  *  d. 
1  D 


for  i  ^  j 


n 


1 

k=1 


uj^(x)v^(x) 


0 


v^{x)  =  (v^ (x), •••,v^(x) are  the 
respectively  corresponding  to  the  eigen- 
we  have 

for  i  ^  j. 


Define 


U(x)  =  (u\x) ,  •,u''(x) ),  V(x)  =  (v\x),  •••,v"(x) ), 

Then  V^(x)U(x)  =  I.  So  U(x),V(x)  6  GL^(CIx]).  As  u^(x)  is  the 
eigenvector  of  A(x)  corresponding  to  d^  we  have  the  equality 
A(X)U(x)  =  U(x)D. 


Also  the  assumption  that  Aq  =  D  means  that  we  can  choose  u^(x)  to 

satisfy  u^(0)  =  (5  ,*••,6.  )^,  i  =  1,»**,n.  This  proves  the  theorem  for  a 
1 1  in 

general  A(x).  Suppose  that  A(x)  S  S^(C(x]).  Then  we  can  choose  T  6  O^. 
According  to  Theorem  8.11  V  =  U.  So  U{x)  6  Oj^(C[x]).  ■ 

Theorem  8.14  does  not  apply  if  A(x)  has  multiple  eigenvalues.  Indeed 


let 


A(x) 


'  X 

-ix 


ix 

-X 


Then  d^  =  dj  =  0  are  the  eigenvalues  of  A(x).  Also  Aq  =  0.  So  (8.13) 
does  not  hold.  Theorem  8.14  together  with  the  equality  (8.8)  yield  Theorem 
7.15  for  n  =  2.  Of  course  the  case  n  =  1,2  can  be  proved  easily  in  the 
direct  way.  However  for  the  cases  n  =3,4  (in  particular  n  =  4)  we  need 
also  the  following  characterization  of  A(x)  6  Mj^(C(x))  with  a  constant 
spectrum.  As  usual  let  (A,B]  denote  the  commutant  AB-BA. 

Theorem  8.16.  Let  F(x)  be  of  the  form  (8.5).  Consider  the  equation  (8.2) 
with  the  initial  condition  U(0)  =  I.  Let  A(x)  be  given  by  (8.13).  Then 
A(x)  is  of  the  form  (8.1)  if  and  only  if  F(x)  satisfies  the  following  non¬ 
linear  equation  of  order  m 
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1 


m-l 


m+1 


In  particular 

(a. 18) 


IR+1 


Proof  The  matrix  V(x)  =  Ll~^(x)  satisfies  the  equation  (8.3).  Thus  if 
A(x)  is  yiven  by  (8.13)  we  have 


dA 


-  =  UFA^V  -  UA^FV  =  U(F.A^1V 


(8.19) 


d  A 


2  u{(f\Aq}  +,  (F,[F,Aq)]}v 


dx 


=  u{  ^  ,A^]  +  •••  +  [F,  [F,  [F,A|j] 

dx  ^ 


,m+1 


Thus  - r  A(x)  H  0  if  and  only  if  (8.17)  holds.  Assume  that  F(x)  is  of 

,  m+1 

dx  tf 

the  form  (8.5).  Then  the  coefficient  of  x  in  the  left  hand  side  of 

(8.17)  is  equal  to  *  *  * »  * ’*  J  “  0  This  proves  (8.18).  ■ 

'  ^Sn+T  ^ 

Assume  that  F^^  satisfies  (8.18).  By  letting  F(x)  =  F^^  in  (8.2)  we 
obtain  F(x)  which  satisfies  (8.17).  Thus  we  proved 
Theorem  8.20.  Let  F^^  satisfies  (8.18).  Then 


Fj^x  -F^x 


m 


(8.21)  A(x)  =e  Ae  =  I  ^  *  *  *  ^  [^\  rA  ]  •  •  •  )x^. 

k=i 

k 

In  particular  4' *  '  *  '  f  ^  nilpotent  matrix. 


The  last  assertion  of  the  theorem  follows  from  the  result  below. 


liiMiiiiiMiSBI 
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Lemma  8.22.  Let  A(x)  be  a  nonconstant  matrix  of  the  form  (8.1).  If  A(x) 
has  a  constant  spectrum  then  A|||  is  a  nilpotent  matrix. 

If 

Proof.  As  tr{A  (x))  is  constant  we  must  have  tr(A  )  =  0,  k  = 

So  Pi^  is  nilpotent.  ■ 

Theorem  8.23.  Let  A  6  have  pairwise  distinct  eigenvalues.  Then  there 

exists  0  ^  B  €  S  such  that  A+xB  has  a  constant  spectrum  if  and  only  if 
n 

there  exist  F  €  such  that 

(8.24)  [F,[F,A]1  =0,  (F,A]  ^0. 

In  particular  A  +  x(F,A]  has  a  constant  spectrum,  whence  [F,A]  is 
nilpotent. 

Proof.  Let  A(x)  =  A+xB.  According  to  Theorem  8.14  the  equality  (8.13)  holds 

for  some  U(x)  6  0^(C[x]).  As  B  ^  0  U(x)  is  not  a  constant  matrix.  Let 

F(x)  be  given  by  (8.2).  Then  0  *  F(x)  6  A  (C[x]).  So  F(x)  is  of  the  form 

n 

(8.5)  and  F^  is  a  nonzero  s)cew  symmetric  matrix.  In  view  of  Theorem  8.16 
F  =  F^  satisfies  the  equality  (F, (F,Aj )  =  0.  Consider  the  matrix  (8.21). 
The  equalities  (8.19)  yield 

(8.25)  A(x)  =  A  +  x[F,A]. 

According  to  Theorem  8.20  A(x)  has  a  constant  spectrum.  Lemma  8.22  implies 
that  [F,A]  is  nilpotent.  It  is  left  to  show  that  [F,A]  *  0.  Suppose 
that  [F,A]  =  0.  That  is  F  and  A  commute.  Since  A  has  n  distinct 
eigenvalues  F  =  p(A)  for  some  polynomial  p(X).  So  F  is  symmetric! 

Thus  F  =  0  which  is  a  contradiction.  ■ 

Thus  to  prove  Theorem  7.15  for  n  =  3,4  we  have  to  show  that  the  only 
skew  symmetric  solution  to  (8.24)  is  F  =  0  when  A  is  a  symmetric  matrix 
with  pairwise  distinct  eigenvalues.  To  prove  Theorem  7.15  for  n  >  5  it  is 
enough  to  find  a  nontrivial  solution  to  (8.24)  with  the  above  restriction  on 
F  and  A. 
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9.  The  equation  [K, [F,A] ]  =  0. 

We  first  recall  some  knovm  facts  about  complex  skew  symmetric  matrices. 
See  for  example  Gantmacher  [1959,  Ch.  11). 

First,  it  A  and  B  are  similar  skew  symmetric  matrices  then  A  and 
B  are  othogonally  similar.  Second,  if  zero  is  an  eigenvalue  of  a  skew 
symmetric  matrix  F  then  in  the  system  of  elementary  divisors  of  F  all 
those  of  even  degree  corresponding  to  the  eigenvalue  zero  are  repeated  an  even 
number  of  times.  Thus  if  F  is  a  nonzero  3x3  skew  symmetric  nilpotent 
matrix  then  F  is  orthogonally  similar  to  the  matrix  H  given  in  (8.9). 

If  F  is  a  nonzero  4*4  skew  symmetric  nilpotent  matrix  then  either  F  is 
orthogonally  similar  to  E  given  by  (8.9)  or  F  satisfies 
(9.1)  F^  =  0,  ker(F)  =  Range(F). 

Theorem  9.2.  Let  0  *  F  e  A  and  A  6  S_.  Assume  that  F  and  [F,A]  are 
- — ■ —  n  n 

nilpotent  and  compute.  Then  for  n=3,4  A  has  at  least  one  multiple 
eigenvalue. 

Proof .  We  break  our  proof  into  three  cases. 

(i)  n  =  3.  Then  we  may  assume  that  F  =  H  and  H  is  of  the  form  (8.9). 
Since  H  and  [H,A]  commute  they  have  a  common  eigenvector  u 

Hu  =  0  =  [H,A]u  =  HAu. 

Since  the  eigensubspace  of  H  is  spanned  by  u  we  must  have  Au  =  Xu. 

Also  u^u  =0.  If  X  was  a  simple  eigenvalue  (i.e.  a  simple  zero  of  |xI-a1 
=  0)  then  the  symmetricity  of  A  yields  that  u^u  ^0.  So  X  is  a  multiple 
eigenvalue. 

(ii)  Let  n  =  4  and  assume  that  (9.1)  holds.  Then  the  assumption  that 

[F,[F,Aj)  =  0  implies  FAF  =  0.  So  ker(F)  ^A  Range(F)  =  A  ker(F).  If  all 
the  eigenvalues  of  A  are  simple  then  splits 

C  =  ker(F)  ®  W 
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r 


i.e,  W  is  an  orthogonal  complement  of  )cer(F).  So  FW  Note  that  dimW 

*  dim  ker(F)  =  2.  Also  the  eigenvectors  of  A, ^  orthogonal 

basis  of  C^,  In  this  basis  F  is  represented  by  F^  •  F2  where  each  F^ 
is  2x2  skew  symmetric  matrix.  Since  F  is  nilpotent  F^  =  0  so  F  =  0. 
This  contradicts  the  assumption  that  F  ^  0.  Whence  A  has  a  multiple 
eigenvalue. 

(iii)  l^et  n  =  4  and  assume 


F  =  diag{H,0)  A 


3 

a 


where  H  is  given  by  (8.9)  and  B  6  S^. 

A  straightforward  calculation  shows  that 


[F, IF, All  = 


3^H^ 


H^3 


So 

[H,[H,Bj]  =  0,  aV  . 

According  to  Theorem  8.20  [H,Bj  is  a  nilpotent  matrix.  Then  by  the 

part  (i)  of  this  proof  Bu  =  Xu  where  u  is  the  eigenvector  of  H.  Since 

t  2 

the  range  of  is  spanned  by  u  the  equality  3  H  =0  implies  that 

3^u  =  0.  So 

Av  =  Xv,  v'^  *  (u^,0),  v%  =  u^u  =  0. 

Thus  X  is  a  multiple  eigenvalue  of  A.  ■ 

Theorem  8.23  and  the  above  theorem  imply  Theorem  7.15  for  n  =  3,4.  So 
we  get 
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Theorem  9.3.  Let  p(X,z)  be  a  nondegenerate  polynomial  of  the  form  (5.1) 

with  r  =  1.  Let  2  <  n  <  4.  Then  there  exists  a  symmetric  pencil  A+xB 

whose  characteristic  polynomial  is  p(A,x).  The  number  of  such  non- 

orthogonally  similar  pairs  is  at  most 

n 

(9.4)  N(n)  =  n  il/2’'“ 

i=2 

and  for  almost  all  p(X,x)  the  number  of  distinct  orbits  of  symmetric  pairs 
is  exactly  N(n). 

Proof.  By  making  a  change  of  variables  x  =  x-j  +  y  and  considering 

Q(A+xB)y^,  Q  e  Ojj  we  may  assume  that  p(X,0)  is  of  the  form  (7.12)  and 

A  =  D  =  diag{d, , • • ‘/d  },  B  =  E.  For  n  <  4  we  showed  that  the  system 
1  n 

(7.14)  has  the  unique  solution  E  =  0.  By  Theorem  3.20  the  map  (7.13)  is 

n  n 

II  i!  covering  of  )n/2^  Thus  there  exist  "generically"  n  i! 

i=2  i=2 

distinct  symmetric  E  such  that 

(9.5)  |Xl-(D+xE)  I  =  i|>(X,x). 

Obviously  for  most  p(X,x)  of  the  form  (5.1)  satisfying  p(X,0)  =  |Xi-d| 
all  the  corresponding  E  have  non-zero  entries.  Also  D+xE  and  D+xPEP  , 

P  G  IX)j^  have  the  same  characteristic  polynomial.  As  E  has  nonzero  entries 
then  P^EP^  =  P2EP2,  P^  6  DOj^  if  and  only  if  P^  =  P2.  So  each  2*^”^ 
distinct  E  satisfying  (9.5)  belong  to  the  same  orbit.  Thus,  generically, 
there  are  N(n)  distinct  orbits  and  we  established  the  theorem. 

For  n=2  we  obtain  that  if  p(X,x)  is  non-degenerate  then  there  exists 
only  one  orbit  of  symmetric  pairs  corresponding  to  p(X,x).  This  result  was 
already  obtained  in  Section  7.  For  n=3,4  we  see  that  in  general  to  a  given 
p(X,x)  of  the  form  (5.1)  with  r  =  1  correspond  more  than  one  orbit.  In 
fact,  it  is  possible  to  generalize  Theorem  9.3  as  follows. 
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Theorem  9. fa.  Let  U  =  diag{d^ , • • be  a  diagonal  matrix  with  pairwise 
distinct  diagonal  entries.  Let  p(A,x)  be  a  polynomial  of  the  form  (5.1) 
with  r  =  1  such  that  p(A,0)  =  |Ai-d|.  Assume  that  2  <  n  <  4.  Let  B  e 
be  given.  Then  there  exists  E  6  such  that 

(9.7)  |Xl  -  x(E+-B)|  =  p(A,x). 

The  number  of  such  E  never  exceeds 

n 

(9.8)  M(n)  =  JI  il 

i=2 

and  for  most  of  such  p(A,x)  the  number  of  such  E  is  exactly  M(n). 

Proof .  Consider  the  map 

^  :  S  -  =  P"'’ 

n 

given  by 

tfl(E)  =  (P^q(D,E+B),p^^ (D,E+B), •••,P^^{U,E+B)). 

So 

l(>^(E)  =  (P^g(D,E),  •••,P^^(D,E)  ). 

Thus  V’^(E)  =  0  implies  E  =  0  and  the  result  follows  by  Theorem  3.20. 

■ 

We  now  prove  the  second  part  of  Theorem  7.15.  By  repeating  the  arguments 
of  part  (iii)  in  the  proof  of  Theorem  9.2  we  get 

Lemma  9.9.  Let  F  6  be  orthogonally  similar  to  E  given  by  (8.9). 

Assume  that  A  6  and  [F,[F,Al]  =  0.  Then  A  has  a  multiple  eigenvalue. 

Thus  in  case  that  n  >  5  we  shall  talce  E  to  be  orthogonally  similar 
to  F  of  the  form 

(9.10)  F  =  diag(H,0),  H  =  ^),  P  =  [| 

More  precisely  using  orthogonal  similarity  we  may  assume  that  A  =  A^  9  A2 
where  A^  6  and  F  e  S  of  the  form  (9.10).  Thus  to  prove  Theorem 
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7.15  for  n  >  5  it  is  enough  to  consider  n  =  5.  Choose 


where 


A  •  (  \ 

0  a 


B  =  diag{d^ I.d^l} 


and  I  is  2*2  identity  matrix.  So 


[H, [H,B] ] 

H^0 

[H, [H,Bl ] 

' 

0 

[F,[F,A1  = 

0H^ 

. 

0 

— 

0 

0 

Clearly 

[H, [H,Bl ]  =  0. 

Choose  0*^  =  (0,b,0,c).  Then  |Xi-a|  =  (  X-d^  )  (  X-d^  )  |  ^I-C  | 


C  = 


d^  0  b 

0  d^  c 

be  a 


So 

|Xl-cl  =  X^  -  t,x2  +  t^X  -  t3 

t^  =  d^  +  d2  +  a 

^3  did2a  -  d2b^  -  d,c^. 

o  y 

Since  d^  *  d^f  ,  t2  and  t^  determine  uniquely  a,  b  and  c  .  Thus  we 
can  choose  a,b,c  such  that 

|Xl-c|  =  (X-d  )(X-d  )(X-d  ), 


where  d^  *  d^  for  i  *  j.  This  proves  the  second  part  of  Theorem  7.5. 
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10.  Symmetric  Polynomial  Matrices 


Let  A(x),  B(x)  6  Sj^(C(xl).  That  is 


m  .  m  . 

I  A.x^  B(x)  =  I  B  x^  A  ,B  6  S  ,  j  =  0, 
j=0 


(10.1)  A(x) 

The  matrix  A(x)  and  B(x}  are  called  orthogonally  similar  if 


3=0  ^ 


(10.2) 


B(x)  =  U(x)A(x)U^(x), 


Bj  =  UAjU*-,  j  =  0, 


and  U(x)  G  0^(C(xl).  If  U(x)  can  be  chosen  to  be  constant,  i.e.  U(x)  =  U 
G  then  A(x)  and  B(x)  are  said  to  be  strictly  orthogonally  similar. 
That  is  the  matrices  (a^ and  {b^ are  simultaneously  orthogonally 
similar 

(10.3) 

Clearly  for  n  >  3  there  are  symmetric  matrices  A(x)  and  B(x)  which  are 
orthogonally  similar  but  not  strictly  orthogonally  similar.  Indeed,  choose 
A(x)  =  Aq  and  U(x)  G  Ojj(C(x])  such  that  B(x)  given  by  (10.2)  is  a  non¬ 
constant  matrix.  So  B  ^  0  and  A_  =  0  for  some  m  >  1.  (Maviously  A^, 
and  B(x)  ate  not  strictly  orthogonally  similar.  Consider  a  characteristic 

polynomial  of  A(x).  It  is  easy  to  see  that  this  polynomial  is  of  the  form 

n 

|Xl-A(x)|  =  X"  +  I  p^(x)X" 


i=1 


(10.4) 


im 


3=0 


i  =  1,2, 


In  particular 

m 

(10.5)  |ai-a_|  .  *"  t  J^Pt,i.i)‘*o' 

If  A(x)  and  B(x)  are  orthogonally  similar  (or  even  similar  over 
GLjj(Clx]))  then  A(x)  and  B(x)  have  the  same  characteristic  polynomial 
p(X,x).  We  now  give  a  simple  condition  on  p(X,x)  which  ensure  strict 
orthogonal  similarity  of  A(x)  and  B(x)  provided  that  A(x)  and  B(x)  are 
orthogonally  similar. 
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Assume  that 


Theorem  10.6,  Let  A(x),  B(x)  H  B^jCClx])  be  of  the  form  (10.1). 

(10.2)  holds  for  U(x)  6  Ojj(C(x)).  Let  p(X,x)  of  the  form  (10.4)  be  the 

characteristic  polynomial  of  A(x).  Assume  that  A^  has  pairwise  distinct 

eigenvalues.  Then  U(x)  is  a  constant  matrix.  That  is  A(x)  and  B(x)  are 

strictly  orthogonally  similar. 

To  prove  the  theorem  we  need  the  lollowing  lemmas. 

Lemma  10.7.  Let  A,B  €  have  the  same  pairwise  distinct  eigenvalues.  Then 

there  exist  2^  distinct  orthogonal  matrices  U  such  that  B  =  UAU*". 

Proof.  Since  A  and  B  are  orthogonally  similar  to  the  same  diagonal 

matrix  D  it  is  enough  to  consider  the  case  A=B=D.  But  then  D  =  UDU^  if 

and  only  if  U  =  diag{+1 , • • * ,±1 }  and  the  lemma  is  proved. 

Let  0  be  a  domain  in  C*^.  Denote  by  H(ft)  the  set  of  analytic 

functions  in  ft.  Then  S  (H(ft))  and  O  (H(ft))  will  denote  the  set  of 

n  n 

symmetric  and  orthogonal  matrices  A(x)  and  U(x)  for  x  6  ft  such  that  the 
entries  of  A(x)  and  U(x)  are  analvtic  functions  in  ft. 

Lemma  10.B.  Let  ft  be  a  simply  connected  domain  in  Assume  that  A(x), 

B(x)  6  Sj^(H(ft)).  Suppose  that  for  each  x  6  ft,  A(x)  and  B(x)  have  the 
same  pairwise  distinct  eigenvalues.  Let 
(10.9)  B(Xg)  =  UgA(XQ)Uy,  Uq  6  O^, 

for  some  x^  6  ft.  Then  there  exists  a  unique  U(x)  e  0^(H(ft))  satisfying 
(10.2)  such  that  U(Xq)  =  Ug. 

Proof.  First  we  note  that  any  U(x)  €  O  (H(ft))  satisfying  (10.2)  is  a 

*“  n 

solution  of  linear  and  quadratic  equations 

B(x)U(x)  -  U(x)A(x)  =  0,  U(x)u'^(x)  =  I. 

As  at  each  point  x  we  have  exactly  2*^  distinct  solutions.  The  implicit 
function  theorem  implies  that  for  any  given  x^  6  ft  there  exists  r^  =  r(x, ) 

>  0  such  that  the  above  system  has  2*'  distinct  analytic  solutions  U(x)  in 
the  disc  |x-x^ |  <  r^.  So  U(x)  can  be  continued  analytically  on  any 
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continuous  curve  Fen,  Thus,  in  qeneral,  we  have  a  multivalued  (with  at 
most  2”  branches)  analytic  orthogonal  matrix  valued  function  U(x) 
satisfying  (10.2)  with  U(Xq)  =  Uq  at  least  for  some  branch  of  U(x).  The 
assumption  that  (n  is  simply  connected  implies  that  U(x)  is  univalued. 


i.e.  U(x)  e  0^(H(U)). 


The  assumption  that  is  simply  connected  is  crucial  to  the  proof  of 
Lemma  10.8.  Indeed  let 


A(x)  =  B(x)  = 


1+x 


-1-x 


Then  in  the  domain 

i)  =  {x,  0  <  |x|  <2} 
A(x)  has  two  distinct  eigenvalues. 
Clearly 


A(-1 )  =  UqA(-1 )Uy,  Uq  = 


r  0 


1 


1 1 


Then  the  solution  U(x)  satisfying  (10.2)  with  S(x)  =  A(x)  and  the 


condition  U(-1 )  =  Uq  is  of  the  form 


U(x)  =  A(x)//x(2+x), 


(-1  =  i  . 


Now  U(x)  is  two  valued  in  since  /x(  2+x)  is  two  valued  in  il.  On 
the  other  hand  if  we  choose  Uq  =  I  then  U(x)  =  I  and  this  solution  is 
clearly  single  valued  in  i).  This  shows  that  the  choice  of  Uq  is  important. 
Proof  of  Theorem  10.6.  Let  |x|  >  r.  Then  by  dividing  (10.2)  by  x'”  we  get 


-1 


m  ,  m  . 

C(x  ')  =  B  +  Ih  .x”^  =  U(x)(A  +  y  A  ,x“^)u'^(x), 
m  .  m-i  m  .  m-i 

1=1  1=1 


Put  y  =  X  .  For  |y|  <  e  the  eigenvalues  of  C(y)  would  be  close  to  the 
eigenvalues  of  and  therefore  will  be  pairwise  distinct.  Thus  we  can 

apply  Lemma  10.8  for  y‘’’A(y"^  )  and  y'”B(y"M.  Hence  U(x)  is  analytic  in 
the  neighborhood  of  x  =  *.  Now  tne  Liouviile's  theorem  implies  that  U(x) 
must  be  constant.  ■ 
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Remark  10.10.  The  results  of  Theorem  10.6  apply  it  we  shall  assume  that  U(x) 

^  o^(H(C)),  i.e.  the  entries  of  U(x)  are  entire  functions. 

Let  A(x)  be  of  the  form  (10.1).  Suppose  that  A(x)  is  non-degenerate. 

That  is  |Xl-A(x)|  is  a  non-degenerate  polynomial.  Hence  D(x)  given  by 

(4.17)  IS  a  polynomial  of  degree  <  n{n-1)m.  Assume  furthermore  that  A^ 

has  pairwise  distinct  eigenvalues.  Then  D{x)  is  a  polynomial  of  degree 

n{n-l)m.  Let  C  ,***,5  be  all  intersection  points  of  |Xl-A(x)|.  So 
1  s 

s  <  n(n-1)m.  Let  B(x)  be  also  of  the  form  (10.1)  and  suppose  that 
(1u.1l)  lXl-A(x)|  =  lXi-B(x)l. 


Pick  Xy  *  3  =  1,***,s.  Let  Uq  satisfy  (10.9).  Then,  according  to  the 

proof  of  Lemma  10.8  there  exists  an  analytic  multivalued  orthogonal  matrix  func¬ 


tion  L'(x,Uy)  on  the  Riemann  sphere  (i.e.  C  u  {“} )  punctured  at  Cj » 


j  =  !,•••, s,  satisfying  (10.2)  such  that 


U(Xo,Uy)  -  'Jg 

tor  at  least  one  branch  of  U(x,Uq).  If  it  happens  that  U(x,Uq)  is  single 
valued  and  the  points  »  j  =  1»***,s,  are  removable  singularities  then  the 
Liouvilie's  theorem  implies  that  U(Xq,Uq)  =  Ug  and  A(x)  and  Ii(x)  are 
strictly  orthogonally  similar.  Note,  as  we  pointed  out  before,  sometime  this 
can  happen  only  for  special  choices  of  Og  out  of  2^  possible  choices.  In 
what  follows  we  give  a  simple  condition  on  Ug  which  will  ensure  the  needed 
properties  of  U(x,Ug). 

Lemma  10.12.  Let  A(x),B(x)  be  of  the  form  (10.1).  Assume  that  A(x)  and 
B(x)  have  the  same  characteristic  polynomial  and  suppose  that  A^  has 
pairwise  distinct  eigenvalues.  (So  |Xl-A(x)|  is  non-degenerate).  Let 
be  the  intersection  points  of  jXl-A(x)l.  Choose  x^  *  , 

j=  1,***,s  such  that  the  real  line  (1-t)x„  +  ti . ,  t  real  contains  only  one 

0  3 

intersection  point  Let  Uj(x)  be  the  analytic  solution  of  the  equation 
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(10.2)  along  the  open  segment  (l-t)x^  +  »  0  <  t  <  1  with  the  initial 

condition  Uj(xQ)  =  Uq  for  a  fixed  choice  of  Uq  satisfying  (10.9).  Assume 
that  Uj(x)  can  be  continued  analytically  to  a  disc  |x-C^(  <  r  for  some 
r  >  0  and  j  =  l,**»,s.  Then  Uj(x)  =  Uq,  j  =  1,»»»,s,  hence  A(x)  and 
B(x)  are  strictly  orthogonally  similar. 

Proof.  Let  £)  be  the  complex  plane  cut  along  the  rays  (l-t)xQ  +  j » 

1  <  t,  j  =  l,***,s.  So  is  simply  connected,  B,  for 

j  =  1,***,s.  Thus,  there  exists  a  unique  U(x)  6  Oj^(H(ft))  satisfying  (10.2) 
and  the  condition  U(Xq)  =  Uq.  Let 

A 

=  n  {(i-t)xQ  +  tc^ ,  t  >  i). 

Note  that  we  can  continue  U(x)  analytically  along  any  closed 

curve  r  in  .  We  claim  that  U(x)  is  a  single  valued  in  .  Indeed  if 

A 

r  is  homotopic  to  a  point  in  (2^  then  of  course  U(x)  is  single  valued 
along  T.  It  is  left  to  examine  the  case  where  F  is  horootopic  to 

circling  k  times  around  the  point  .  Since  U^(x)  can  be  continued 
analytically  in  the  neighborhood  of  we  see  that  the  U(x)  remains  single 

valued  on  F  in  this  case  too.  Finally,  the  assumption  that  U^(x)  can  be 
continued  analytically  in  the  neighborhood  of  implies  that  U(x)  is 

single  valued  and  analytic  in  (2^  =  $2^  U  }.  Continuing  in  the  same  manner 
we  deduce  that  U(x)  can  be  extended  analytically  to  the  whole  complex 
plane.  As  A^^^  has  pairwise  distinct  eigenvalues  we  get  that  q(X)  given  by 
(10.5)  has  simple  roots.  Now  Remar)s  10.10  implies  that  U(x)  =  Uq.  ■ 

Let  A(x),  B(x)  be  of  the  form  (10.1).  Suppose  that  A(x)  and  B(x) 
have  the  same  characteristic  polynomial  which  is  non-degenerate.  In  order  to 
be  able  to  apply  the  above  lemma  we  have  to  insure  that  A(x)  and  B(x)  are 
(i)  analytically  similar  in  the  neighborhood  of  each  intersection  point 


C.  That  is 
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(1U.13)  B(x)  =  U(x)A(x)U(x)"' ,  U(x)  6  GL^(H(D)). 

Here  D  is  some  disc 
(10.14)  D(C,r)  =  {x, |x-cl  <  r} 

and  GL^(H(ft))  is  the  set  of  invertible  matrices  tJ(x)  such  that  the 
entries  of  U(x)  and  U“\x)  are  analytic  functions  in  A. 

(ii)  U(x)  can  be  chosen  to  be  an  orthogonal  matrix. 

The  question  of  local  (analytic)  similarity  was  studied  by  us  in 
Friedland  [1980].  The  first  step  is  to  bring  A(x)  to  a  bloc)c  diagonal  form 
that 

-1  ^ 

(10.1b)  V  (x)A(x)V(x)  =  19  A.(x),  V(x)  6  GL  (H(D(;,r))), 

j=1  ^ 

such  that  A^(?)  has  one  eigenvalue  ,  j  =  1,*»*,m  and  ^  \ 

j  ^  )t.  Here  r  is  some  positive  number.  Also 
-1  r 

(10.16)  W(x)  B(x)W(x)  =  2  »  B.(X),  W(x)  e  GL  (H(D(  (;,e) ) ), 

j=1  ^  " 


(10.16) 

W(x) 

“^B( 

and  Aj 

(X) 

and  Bj 

(X) 

for  j 

=  1, 

So  , 

and  only  if 

Aj  (X) 

and 

Clearly 

if 

Aj  (x) 

and 

Aj  (x) 

and 

Bj(x) 

have 

Aj  (x) 

ani^ 

Bj  (x) 

are 

In  case  that  Aj(x)  and  Bj(x)  are  not  1x1  matrices  it  may  well 
happen  that  Aj(x)  and  Bj(x)  have  the  same  characteristic  polynomial  but 
Aj(x)  and  Bj(x)  are  not  locally  similar.  A  simple  criterion  due  to  Wasow 
[1963]  gives  an  additional  condition  on  Aj(C)  and  Bj(5)  which  ensures  the 
local  similarity  of  Aj(x)  and  Bj(x). 

Theorem  10.17.  Let  A(x),  B(x)  e  M^(H(D( r ) ) ) .  Assume  that  A(x)  and 
B(x)  have  the  same  characteristic  polynomial.  If  the  minimal  polynomial  of 
A(C)  and  B(5)  is  equal  to  its  characteristic  polynomial  then  A(x)  and 
B(x)  are  locally  similar. 
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r 


Let  A(x)  e  MjCHCD))  for  some  disc  (10.13).  The  interesting  case  from 

our  point  of  view  is  when  A(C)  has  a  double  eigenvalue  In  the 

neighborhood  of  C  the  eigenvalues  of  A(x)  behave  as  an  algebriac  function 

of  X,  i.e.  they  must  have  a  Puiseaux  expansion  (4.5)  with  m  =  2.  For  a 

certain  Puiseaux  expansion  of  X(x)  it  is  possible  to  tell  when  the  minimal 

2 

polynomial  of  A(C)  is  • 

Lemma  10.18.  Let  A(x)  6  M2(H(D)),  D  =  D(C/r).  Assume  that  the  eigenvalues 

of  A(x)  have  the  Puiseaux  expansion  (4.5)  with  m  =  2  in  the  neighborhood 

2 

of  5.  If  ^  0  then  the  minimal  polynomial  of  A(C)  is  (X-u^)  . 

Proof.  Assume  to  the  contrary  that  A(C)  =  So 

(10.19)  A(x)  =  X^l  +  (x-i:)B(x),  B(x)  e  M2(H(D)). 

Since  the  eigenvalues  of  B(x)  have  the  Puiseaux  expansion  we  deduce  that 
=  0  contrary  to  our  assumptions.  ■ 

Let  A(x)  be  of  the  form  (10.1)  and  suppose  that  Aj^^  has  pairwise 
distinct  eigenvalues.  So  |Xl-A(x)|  is  non-degenerated  and  the  discrim¬ 
inant  D(x)  given  by  (4.17)  is  a  polynomial  of  degree  n(n-1 )m.  Let 

(10.20)  6(Ag,»*»,A^)  2  6(|Xi-a(x)|) 

be  the  discriminant  of  D(x)  given  by  (5.9)  (r=m). 

Suppose  that 

(10.21  )  <5(A-,**»,A  )  *  0. 

u  m 

Then  at  each  intersection  point  C»  A(C)  has  exactly  one  double  root 
Moreover  the  Puiseaux  expansion  of  those  two  eigenvalues  satisfy  the  assump¬ 
tions  of  Lemma  10.18  (Theorem  4.22).  So  if  B(x)  is  of  the  form  (10.1)  and 
|Xi-A(x)|  =  |Xi-B(x)|  then  A(x)  and  B(x)  are  locally  similar  in  the 
neighborhood  of  any  finite  or  infinite  point  C  if  the  condition  (10.20) 
holds.  Our  next  step  is  to  show  that  the  similarity  marix  in  (10.13)  can  be 
chosen  to  be  orthogonal.  This  is  implied  by  the  following  two  lemmas. 
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Lemma  10.22.  Let  A(x)  6  S^(H(D(C»*^' ) ) ).  Suppose  that  A(t>  has  m  distinct 
eigenvalues  is  the  multiplicity  of  A^.  Then  there 

exists  v(x)  e  Oj^(H(D(^,r ) ) )  satisfying  (10.15)  where  A^tC)  has  one  eigen¬ 
value  A  for  some  r  >  0. 

Proof .  By  considering  the  matrix  yA{5)Q^,  Q  €  O  we  may  assume  that 

m 

A(?)  =  I  9  A  (C), 

j=1  J 

where  each  A^(i;)  has  one  eigenvalue  A^. 

Choose  a  positive  p  such  that 

D(A^,p)  n  L)(A^,P)  =  4>  for  i  *  j. 


E.(x)  =  — ^  /  (Ai-a(x))  ^dA 

1  2iri  i ,  <  1 

|A-A^|=p 

E^(x)  =  E.(x)  =  E^(x),  E.(x>E  (x)  =  0 
3  3  3  3  k 

for  k  ^  j,  rank  E.  (x)  =  n^ ,  j  =  1,»»»,in 
for  |x-Cj  <  r"  for  some  positive  r". 


i.  =  n„+***+n.  ,  j  =  1,»*»,m+1,  n„  =0,  u  =  (5  )  . 

j  0  3-1  ■'  0  a  oa  cm 

Consider 

u^(x)  =  E.(X)U^,  a  = 

Note  that  ®  =  1,*»*,n.  Apply  the  Gram-Schmidt  process  to 

u^  (x) ,  •••,Ujj  (x)  to  obtain  analytic  vectors  v^  +1  '  *  ’  *''^4 

3  3+1  3  j+1 

the  neighborhood  of  C  such  that 

v^(x)vg(x)  =  j  =  1,*-,m. 


Since  E 


1  <  a, 3  <  n. 


ij(x)Ej^(x)  =  0  for  j  ^  k  we  get  ^^(x)  Vg(x)  =  <S^g, 


V(x)  =  (v,(x),**»,v  (x))  6  O  (H(D(i;,r))) 
1  n  n 


and  the  equality  (10,15)  holds. 
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Lemma  10.23.  Let  A(x)  e  S2(H(U(5,r * ) ) ).  Then  there  exists  U(x)  6 
02(H(D((,r ) ) )  for  some  r  >  0  such  that  U{x)A(x)U^(x)  has  one  of  the 


following  forms 


(i) 


a(x) 

0 


0 

b(x) 


(ii) 


1  +a ( X )  b ( X )  1 

b(x)  -1+a(x) 


,  b(C)  =  i 


(iii)  (  I  a.(x-C)^)(Q  °)  +  (x-C) 

j=0 


k+1 


f  1+a(x)  b(x)  1 
b(x)  -1+a(x) 

b(C)  =  i,  k  >  0. 


In  all  the  cases  the  functions  a(x),b(x)  and  the  values  are 

determined  by  the  characteristic  polynomial  of  A(x).  If  the  eigenvalues  of 
A(x)  are  not  identical  then  the  value  of  k  in  (iii)  is  bounded  from  above. 
Proof .  Suppose  first  that  A(C)  has  two  distinct  eigenvalues.  Then  A(x) 
is  orthogonally  similar  to  the  matrix  of  the  form  (i)  by  virtue  of  Lemma 
10.22.  Assume  now  that  A(C)  has  a  double  eigenvalue.  By  considering  the 
matrix 


A(x)  -V2tr(A(x))I 

we  may  assume  that  A(x)  of  the  form 

f  o(x)  0(x) 


A(x)  = 


I  6(x)  -a(x)J 

Suppose  that  the  minimal  polynomial  of  A(C)  is 
transformation  QA(x)A^,  Q  6  O2  we  may  assume  01(C)  = 
choose  in  the  form  (ii) 

b  =  e/l  +  (a^-1)6"^ 


.2 

A  .  Using  the 
1,  3(C)  =  i.  Then 


and  let 

U(x) 


■u(x) 

.v(x) 


-v(x) 

u(x) 


V  =  u  =  1//l  +  (p-1  )^(3-b)"^. 
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0 


Assume 


Then  U(x)A(x)U^{x)  will  be  of  the  form  (ii)  with  a(x)  = 

now  that  the  minimal  polynomial  of  A(C)  is  A.  Then  A(C)  =0  and 

A(x)  =  (x-C)B(x),  B(x)  e  S2(H(D(C,r))). 

If  the  minimal  polynomial  of  B(C)  is  A  then  B(x)  is  of  the  form  (ii) 

and  we  achieved  the  form  (iii).  If  B(C)  =0  then  we  continue  in  the  same 

manner.  If  we  stop  the  process  at  the  step  k  we  shall  achieve  the  form 

(iii).  If  the  above  process  never  stops  then  we  deduce  that  A(x)  is  of  the 

form  (i)  and  a(x)=  b(x).  It  is  easy  to  see  that  a(x),b(x),  are 

determined  by  tr(A(x)/  and  |a(x)|.  Suppose  we  have  the  form  (iii).  Then 

the  Puiseaux  expansion  of  both  eigenvalues  of  A(x)  are  of  the  form 

k 

A  (x)  =  I  a, (x-C)^  +  lx-5|  0(1 ),  h=1,2. 

j=0  ^ 

Thus  if  A^(x)  ^  A^ix),  k  has  an  upper  bound.  ■ 

Let  A(x)  be  of  the  form  (10.1).  Suppose  that  6(A^,»**,A  )  *  0. 

0  m 

Then  D(x)  given  by  (4.17)  is  a  polynomial  of  degree  n(n-l)m  having 

n(n-1)m  simple  zeros  ?,»•••/?  ,  s  =  n(n-1 )m.  Thus  lAl-A(x)|  has 

Is 

n(n-1  )m  intersection  points  intersection  point 

there  exists  V,  (x)  6  0  (H(D(C  ,r)))  such  that  (10.15)  holds.  Here 
k  n  k 

t  =  n-1 ,  A^(x)  6  S2(H(D(Cj^,r) ) ),  A^  (x)  is  of  the  form  (ii) 

ri  +  a,^(x)  b^(x)  ] 


(10.24) 


A^  (x)  =  Cj^(x)  = 


bk(x) 


-1+aj^(x) 


in  view  of  Theorem  4.22,  Lemmas  10.18  and  10.23.  Other  Aj(x)  are  the 
analytic  roots  of  |Ai-A(x)|  in  the  neighborhood  of  Cq.  Assume  that  B(x) 
is  of  the  form  (10.1)  and  suppose  that  |Ai-A(x)|  =  |Ai-B(x)|.  Then  at  the 
neighborhood  of  each  intersection  point  C.  there  exists  W^(x)  6 
0^(H(D(Cj^,r ) ) )  such  that  (10.16)  holds  and 
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(10.25) 


n-1  n-1 

I  9  A. (x)  =  I  9  B. (x). 
j=l  ^  ^ 


Choose  Xq  6  C,  Xq  ^  Cj,  j=1 ,  •••,n(n-1)in,  such  that  the  line  (l-t)xQ 
+  tC j ,  -"  <  t  <  *,  contains  only  one  intersection  point  By  considering 

the  matrices  ^A(x)Q^,  Q2B(x)Q2,  «  O^,  j  =  1,2  we  assume 

(10.26)  ^^^0^  “  ®**0^  =  0  =  diag{d^ , 

Let  a  be  a  simply  connected  domain  a  containing  the  segment  (1-t)x  + 

K  K  ^ 

»  0  <  t  <  1  and  such  that  j  =  1#***#s*  Choose  T  6  DO^. 

K  3  ^ 

Let  U^(x,T)  6  O  (H(a  ))  be  the  unique  solution  of  (10.2)  satisfying 
jc  n  X 

Uj^(Xo,T)  =  T.  Thus  tor  X  6  a^^  n  D(Cj^,r)  =  aj^  we  have 

t  t 

B(x)  =  Wj^(x)(  2.  ®  A.(x))Wj^(x)  =  Uj^(x,T)A(x)Uj^(x,t)  = 

j=1  ^ 


Uj^(x,T)Vj^(x)(  I  9  A^(x))vJ^(x)U^. 

As  A.(C)  and  A.(C)  do  not  have  a  common  eigenvalue  for  j  *  i.  we  easily 

j  * 

deduce 

n-1 

(10.27)  "  J  «  Rj  e  o^(H(aj^)). 

In  particular  |Rj(x)|^  =  1.  So  for  j  >  2  each  Rj(x)  is  a  constant  number 
of  modulus  1 . 

For  R^(x)  we  have 


(10.28) 


Rl(x)  =  Qj^(x,B,T)  €  02(H(a|^)) 
g|^(x,B,T)C|^(x)gj^(x,B,T)  =  Cj^(x) 


where  is  given  by  (10.24). 

Let 

(10.29)  Uj^(B,T)  =  |g|^(x,B,T)  1 ,  k  =  1,  •••,n(n-1  )m. 

So  U)j^(B,T)  =  ±1. 

Lemma  10.30.  Let  the  above  assumption  hold.  Suppose  that  a)j^(B,T)  =  1  then 
the  matrix  can  be  continued  analytically  to  the  neighborhood  of 
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Proof.  Consider  the  matrix  C,  (x),  x  e  ft.  .  So  C,,(x)  has  two  distinct 
■  K  K  ^ 

eigenvalues.  THus  there  are  four  orthogonal  matrices  commuting  with  Cj^(x). 
Namely  ±1,  ±Q,  where  jQ|  =  -1 .  Therefore  if  =  1  then 

yj^(x,B,T)  is  a  constant  matrix  which  equals  to  ±1. 


Hence 

extended  to 


n-1  .. 

I  K.(x)  is  constant  in  ft  and  therefore  can  be  analytically 

j  =  1  ^ 

D(C, ,r).  Finally  (10.27)  shows  that 
n-1 


U  (x,T)  =  W  (x)(  9  R.  (x)  )V^(x) 

3=1 

so  Uj^(x,T)  has  analytic  extension  to  D(C^,r).  I 

Theorem  10.31.  Let  A(x),B(x),  B{x)  6  S  (C[x] )  oe  of  the  form  (10.1). 
— *■ "  —  '  ■  n 

Assume  that 


|Xl-A(x)|  -  |Xl-B(x)|  =  |Xl-B(x)|. 

Suppose  that  5(A  .•••,A  )  *  0.  Let  C,,***,C  »  s  =  n(n-1 )m  be  the  inter- 
u  m  Is 

section  points  of  |Xl-A(x)|  and  suppose  that  the  line  (l-t)Xj^  +  tC^ , 

_o»  <  t  <  *,  only  one  intersection  point  for  j  =  1,*»»,s. 

Assume  that  A(Xq)  =  B(Xq)  =  B(Xq)  is  a  diagonal  matrix.  Let 

T,T  fe'  DO  and  suppose  that 
n 

(10.32)  a)j^(B,T)  =  U)j^(B,T),  j  =  1 ,  •  •  •,n(n-1  )m. 

Then 


(10.33)  B(x)  =  T  TB(x)TT. 

Proof .  In  the  arguments  preceding  Lemma  10.30  replace  B,U,W,R  by  B,U,W,R 
respectively.  So  we  get 


Thus 


Ur  =  9  R.(x))Vr(x). 


^  .  n-1 

0  U  =  W  (  ) 

k  k  k\^, 

3  =  1 


9  R.  (x)R.(x)  )W 
3  3 


t 

k 


and 


(x)R^(x)Cr(x)R^rJ(x)  =  Cr(x). 
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As 

|R. {x)R^(x)| 

we  deduce  that  R^(x)R^(x) 
in  the  neighbnorhood  of 
the  equation 


■  01^(8, T)a^(B,T)  =  1 
~  t 

±I  so  R^(x)R^(x)  has  an  analytic  continuation 
Hence  the  orthogonal  matrix  A(x)  satisfying 


B{x)  =  y(x)B(x)g^(x),  Q(Xp)  =  TT 

fulfills  the  assumptions  of  Lemma  10.12.  So  Q(x)  =  TT  and  (10.33)  holds. 
Theorem  10.34.  Let  p(A,x)  be  a  polynomial  of  the  form  (10.4).  Assume  that 
6(p)  *  0.  Then  there  are  at  most  v  =  2^”  1)(mn  1)  polynomial  symmetric 
matrices  A^ (x), • • •,Ay(x)  of  the  form  (10.1)  such  that 
lll-Aj(x)|  =  p(X,x),  j  =  1,***/V  and 

A.{x)  ^  QA^(x)Q^ 

for  i  ^  j  and  any  Q  e  O^. 

Proof.  Assume  to  the  contrary  that  the  matrices  A^ (x) ,  •••,A^(x)  and 
A(x)  are  of  the  form  (10.1),  their  characteristic  polynomial  is  p(X,x)  and 
any  of  the  matrices  are  not  strictly  orthogonally  similar.  Choose  Xq  as  in 
Theorem  10.31.  We  assume  that 

A(Xq)  =  Aj(XQ)  =  D,  j  =  1,*»«,v 

where  D  is  a  diagonal  matrix.  We  first  note  that  if  ^  ^^n 

T^  ^  T^  then 

for  some  k.  Otherwise  Theorem  10.31  imply  lhat 


A.  =  T,T2AjT,T2. 

As  T^T^  *  ±1  it  easily  follows  that  there  exists  a  permutation  matrix  such 
that 


PAjP-  =  Aj,  9  Aj2. 
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Therefore 


p(X,x)  =  ©A.^l  =  |Ai^-A. J IXI^-A.^I 

and  this  is  impossible  since  p(X,x)  is  irreducible. 

Also  for  any  ,T2  fc' 

a^(A..T,)  ^  co^CA.T^),  i  ^  j 

for  some  k.  Otherwise  A^(x)  and  Aj(x)  are  strictly  orthogonally 
equivalent  which  contradicts  our  assumptions.  Let 

ii)(A.  ,T)  =  (0),  (A.  ,T),  (A.,T))  s  =  n(n-l)m. 

1  11  SI 

Then  the  set  {(d(a.,T)},  i=1,***,v,  T  6  OD  contains  v  2*^  ^  =  2^*^  1  )nm 

1  n 

distinct  vectors.  As  oi^^CA^.T)  =  ±1,  k  =  1 ,  •  • ‘.nCn-l  )m  we  deduce  that 
there  exists  i  such  that  w^(A^,T)  =  1,  k  =  1 ,  •  • '.nC n-1 )m.  Clearly 
aij^(A,I)  =  1,  k=1 ,  ‘.nCn-l  )m.  Thus  Theorem  10.31  implies  that  A(x)  and 

A^(x)  are  strictly  orthogonally  equivalent  which  contradicts  our  assumptions. 
The  proof  of  the  theorem  is  completed. 
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11.  Conclusions,  remarks  and  open  problems. 


Let  Sjj  be  the  space  of  m+1  symmetric  tuples  ^^o'***'^m^ 


(11.1) 


Thus  S, 


h,m+1 


S  =  s  X  •••  X  s  . 

n,m+1  n  n 

m+1 

can  be  identified  with  )n(m+1  )/2^  complex  orthogonal 


group  Ojj  is  acting  naturally  on 


(11.2) 


U[(A,  )]  =  U(A,,,***,A  )U^,  U  e  O  . 

u  m  u  m  n 


To  each  m+1  symmetric  tuple  (A„,»*»,A  )  we  correspond  a  polynomial 

0  m 

symmetric  matrix  A(x)  6  S^(C(xl)  by  means  of  the  formula  (10.1).  Consider 
the  characteristic  polynomial  |Xl-A(x)|  given  by  (10.4).  Clearly 


Pij  (Ao*  *  * ’.Am)  is  a  polynomial  on  which  is  invariant  under  the 

action  of  0„.  So  p..(A„,***,A  )  G  (S  ,1^.  The  coefficients  of 
n  ^1]  0  m  n,m+1 

|Ai-A(x)|  induce  the  map 


(11.3) 


P*^n,m+1 


i  =  1 ,  j  =  0,  •••,mi, 

where  is  the  coefficient  space  of  the  polynomials  of  the  form  (5.1). 

Denote  oy  6(p(A„,»»*,A  ))  the  discriminant  of  |Xl-A(x)|  given  by  (5.9) 
o  m 

(r=m).  We  claim  that  6^0  on  m+i •  proof  is  identical  to  the  proof 

of  Theorem  6.5  except  that  instead  of  the  pencil  A+xB  we  have  to  consider 

the  polynomial  matrix  A+x^B.  Next  we  claim  that  the  transcendence  degree 

of  t'^Hrm+l^^  mn(n+1  )/2  +  n.  Indeed,  assume  that  Aq  has  pairwise 

distinct  eigenvalues.  So  there  exist  m+1  tuple  (D,E^ , • • 'jE^)  orthogonally 

similar  to  (A  , •••,A  )  such  that  D  is  a  diagonal  matrix.  The  matrices 
u  m 

E, ,•••,£  are  fixed  up  to  the  action  of  the  discrete  group  DO  .  Thus 
1  m  "  n 

(D,E,  ,•••,£  )  parametrize  most  of  the  orbits  in  lS„  so  the 

1  m  n » m+1 

transcendence  degree  of  f'^n,m+1^^  most  mn(n+1)/2+n.  An  obvious 


modification  of  Theorem  7.4  yields 


Theorem  11.4.  The  polynomials 


(11.5)  tr(AQ),  1  =  1,*“,n,  tr(A^Aj^A^A^),  0  <  i  <  j  4  n-1,  k  =  1,**Mn 

torm  a  transcendence  basis  in  [5^  Moreover,  if  Aq  has  n  distinct 

2 

eigenvalues  then  the  values  of  these  polynomials  determine  at  most  2'"^’^  n+2)/2 

distinct  orbits. 

Theorem  10.34  implies 

Theorem  11.6.  The  polynomials  p. .(A^,***,A  ),  i  =  1,***,n,  j  =  0,*»',mi, 

— -  13  U  m 

given  by  (10.4)  form  a  transcendence  basis  in  [Sj^  m+1  ^  precisely,  if 

))  ^  0  then  the  values  of  these  polynomials  determine  at  most 

0  m 

2(n-1)(mn-1)  distinct  orbits. 

Proof.  As  the  transcendence  degree  of  m+1^*^  mn(n+1  )/2+n  it  is 

enough  to  show  that  p.  .(A„,**»,A  )  i  =  1,***,n,  3  =  0,1, •••,im  are  alge- 

130  m 

braicaliy  independent.  That  is  the  map  p  given  in  (11.3)  is  proper.  Sup¬ 


pose  that  p  is  not  proper.  Then  for  "most"  of  n  6  m+1^'  ^  ^ 


variety  of  dimension 
n(n-1 ) 


+  1  =  (m+1 )n(n+1 )/2  -  lmn(n+1 )/2+n]+1 


at  least.  Choose  (A,.,***,A  )  S  S  .  such  that  A^  has  pairwise  distinct 

0  m  n,m+l  0  ^ 

eigenvalues  and  6(p(A„,***,A  ))  *  0,  According  to  Theo. im  10.34  there  are  at 

0  m 

most  distinct  orbits  satisfing  p(B^,*»*,B  )  =  v  =  p(A,^,***,A  ). 

’  0  m  0  m 

As  the  eigenvalues  of  Bq  are  pairwise  distinct,  there  are  only  2^ 

orthogonal  matrices  which  commute  with  So  the  orbit  of  (B-,***,B  ) 

under  the  action  of  0^^  is  of  dimension  n(n-1 )/2  -  the  dimension  of  the 

connected  component  of  0^.  Therefore  p  ^(t)  has  dimension  n(n-1)/2  for 

such  (A  , •••,A  ).  Clearly,  the  set  of  m+1  symmetric  tuples  (A  ,»**,A  ) 

0  in  U  m 

such  that  5(p(A„,***,A  ))  ^0  and  An  has  a  pairwise  distinct  eigenvalue  is 
o  m  V 

an  open  (algebraical)  set  in  5^  tn+1  *  "most"  of  it  e  m+1^' 

p  ^(Ti)  is  a  variety  of  dimension  n(n-1 )/2.  The  above  contradiction  proves 


tfidt  p  IS  a  proper  map.  The  proof  of  the  theorem  is  completed. 


-91- 


We  conjecture 


Conjecture  11.7.  The  map  (11.3)  is  onto  map. 

The  results  of  Sections  7-9  confirm  the  conjecture  for 

n  <  4  and  m  <  2.  Theorem  11.6  yields 

/ . ,  o  »  ^  ^(n-l  )  (mn-1 ) 

(11.8)  deg  p  <  2 

Problem  11.9.  Find  the  degree  of  the  map  p  given  by  (11.3). 

Next  we  observe  that  the  results  of  Theorem  10.34  apply  to  a  larger  class 
of  matrices. 

Theorem  11.10.  Let  (A,,*  “.A  )  6  S  ,  and  consider  A(x)  6  S(C[x])  given 
- - —  0  m  n,m+1 

by  ( 1 0. 1  ) .  Assume 

(i)  for  each  (  €  C,  each  eigenvalue  of  A(C)  is  either  simple  or  double, 

(ii)  each  eigenvalue  of  A^jj  is  either  simple  or  double. 

Then  the  values  of  the  polynomials  P^j  ^^0' *  *  *'*m' '  ^  ~ 

j  =  0,***,mi,  (i.e.  the  characteristic  polynomial  lXl-A(x)|)  determine  at 

most  ^  ^  distinct  orbits. 

Proof .  The  assumptions  of  the  theorem  imply  that  p(X,x)  =  |Xl-A(x)|  is  non¬ 
degenerate.  Moreover  for  each  intersection  point  A(x)  is  orthogonally 

t 


similar  to  I  ^  A. (x),  i.e.  V  in  (10.15)  belongs  to  O  (H(D(C,r))), 
j=1  ^ 

where  each  Aj(x)  is  either  2X2  or  1 xi  symmetric  matrix.  Suppose  that 
A.  6  S  (H(D(C,r ) ) ).  Since  p(X,x)  is  not  degenerate  A^(x)  has  two 
distinct  eigenvalues  ^  Xg(x).  According  to  I^emma  10.23  Aj(x)  can  be 

analytically  similar  to  a  finite  number  of  matrices  of  the  form  (i)-(iii)  and 
this  number  can  be  bounded  by  using  the  Puiseaux  series  of  the  eigenvalues 
of  A(x)  at  X  =C.  Therefore  we  have  a  finite  number  of  classes 
of  A(x)  of  the  form  (10.1)  such  that 

(i)  all  of  them  have  the  same  characteristic  polynomial  p(X,x) 


-92- 


(ii)  it  A(x),  B(x)  e  A^,  then  A(x)  and  B(x)  are  orthogonally 

similar  in  the  neighborhood  of  any  C  (finite  or  infinite).  Now,  the 

arguments  in  the  previous  sections  show  that  in  each  ^  there  is  only  a 

finite  number  A, (x),*»*,A^  (x)  such  that  A  (x)  and  A_(x)  are  not 
1  a  p 

strictly  orthogonally  similar  for  1<a<0<k.j.  So 

p..(A_,***,A  ),  i  =  1,*»»,n,  j  =  0, •••,mi,  determine  a  finite  number  of 
^13  0  m 

distinct  orbits.  The  inequality  (11.8)  combined  with  Theorem  3.17  implies  the 
number  of  distinct  orbits  determined  by  ^m^' 

i  =  1,***,n,  j  =  0,***,mi  is  at  most 
2(0-1 ) (mn-1 )^  H 

In  many  physical  applications  Aq,»»*,a^  are  symmetric  and  real.  That 

is  we  restrict  ourselves  to  m+1^®^*  that  case  the  real  orthogonal 

group  Oj^(R)  is  acting  naturally  on  m+1^^^*  particular  each  orbit 

orb(A„,»**,A  )  is  compact.  We  pose  the  following  problem. 

0  m 

Problem  11.11.  Let  (A„,*»»,A  )  e  S  .(R).  Does  the  characteristic  poly- 
“  u  m  n#in+i 

m 

nomial  | Xl  -  I  A.x  |  determine  always  a  finite  number  of  orbits?  What  is 
i=0  ^ 

the  value  of  this  number? 

As  in  Section  5  let  denote  the  variety  of  reducible  polynomials 

in  pn»m+1^ 

(11.12)  ^rd,n,m+1  =  P  ^^^rd^* 


If  p  was  onto  map  then  Corollary  (5.22)  would  yield 


(11.13) 


codim'S  <  m(n-1 ). 

rd,n,m+1 


Problem  11.14.  Find  codimS^.^^^^^^,  and  codimS^.^^^^^^,  (R)  in  S^,„+i(R). 

Here  5rd,n,m+1  °  ^rd,n,m+1  "  ^n,m+1^^^** 

In  Friedland-Simon  (1981]  we  showed 


(11.15) 


codimS  J  , (R)  <  n-1 
rd,n,1 


and  we  conjectured  the  equality  sign  in  (11.15).  We  proved  this  conjecture 


for  n>2,3.  We  now  point  out  briefly  how  to  prove  this  conjecture  for  n«4 
(11.16)  codimSy^^^^^ (R)  »  3 

Assume  that  Aq,A^  6  ^(R)  and  let  p(X,x)  =  | X1-(Aq+xA^ ) | .  Assume 
first  that  p(X,x)  is  not  degenerate.  Then  Theorem  9.6  implies  that  there  is 
only  a  finite  number  of  orb(Ay,A^)  with  the  characteristic  polynomial 
p(A,x).  Recall  that  in  this  case  orb(AQ,A^)  is  a  real  variety  of  dimen¬ 
sion  6  =  4*3/2  Assume  next  that  p(A,x)  is  degenerate.  It  can  be  shown  in 
Shapiro  [1979]  that  for  n  <  4  the  Kippenhahn  conjecture  [1951]  is  valid. 
Conjecture  11.17.  (Kippenhahn).  Let  A,B  be  n>9i  Hermitian  matrices. 

Assume  that  A-t-xB  is  a  degenerate  pencil.  Then  A  and  B  have  a  common 
non-trivial  subspace.  That  is  there  exists  a  unitary  matrix  U  such  that 
U*(A+XB)U  =  (A^+XB, )  9  (A2-«-xB2  ). 

In  case  that  A,B  are  real  symmetric  U  can  be  chosen  to  be  a  real 
orthogonal  matrix. 

Thus,  it  follows  that  p(A,x)  in  this  case  determines  also  a  finite 
number  of  orbits  orb(AQ,A.| )  whose  dimension  is  6  at  most.  Since  the 
codimension  of  M^<j(R)  in  P"'^(R)  is  3  we  obtain  the  equality  (11.6). 

Also  the  above  arguments  show  that  Problem  11.11  has  a  positive  answer  for 
m=1  and  n  <  4. 
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